Bilinear attention networks
Attention networks in multimodal learning provide an efficient way to utilize given visual
information selectively. However, the computational cost to learn attention distributions for …
information selectively. However, the computational cost to learn attention distributions for …
Inverted residuals and linear bottlenecks: Mobile networks for classification, detection and segmentation
In this paper we describe a new mobile architecture MobileNetV2 that improves the state of
the art performance of mobile models on multiple benchmarks across a spectrum of different …
the art performance of mobile models on multiple benchmarks across a spectrum of different …
A gift from knowledge distillation: Fast optimization, network minimization and transfer learning
We introduce a novel technique for knowledge transfer, where knowledge from a pretrained
deep neural network (DNN) is distilled and transferred to another DNN. As the DNN …
deep neural network (DNN) is distilled and transferred to another DNN. As the DNN …
The one hundred layers tiramisu: Fully convolutional densenets for semantic segmentation
State-of-the-art approaches for semantic image segmentation are built on Convolutional
Neural Networks (CNNs). The typical segmentation architecture is composed of (a) a …
Neural Networks (CNNs). The typical segmentation architecture is composed of (a) a …
Deep pyramidal residual networks for spectral–spatial hyperspectral image classification
Convolutional neural networks (CNNs) exhibit good performance in image processing tasks,
pointing themselves as the current state-of-the-art of deep learning methods. However, the …
pointing themselves as the current state-of-the-art of deep learning methods. However, the …
Gaussian error linear units (gelus)
We propose the Gaussian Error Linear Unit (GELU), a high-performing neural network
activation function. The GELU activation function is $ x\Phi (x) $, where $\Phi (x) $ the …
activation function. The GELU activation function is $ x\Phi (x) $, where $\Phi (x) $ the …
Beyond finite layer neural networks: Bridging deep architectures and numerical differential equations
Deep neural networks have become the state-of-the-art models in numerous machine
learning tasks. However, general guidance to network architecture design is still missing. In …
learning tasks. However, general guidance to network architecture design is still missing. In …
Deep pyramidal residual networks
Deep convolutional neural networks (DCNNs) have shown remarkable performance in
image classification tasks in recent years. Generally, deep neural network architectures are …
image classification tasks in recent years. Generally, deep neural network architectures are …
[HTML][HTML] Toward an integration of deep learning and neuroscience
Neuroscience has focused on the detailed implementation of computation, studying neural
codes, dynamics and circuits. In machine learning, however, artificial neural networks tend …
codes, dynamics and circuits. In machine learning, however, artificial neural networks tend …
Few-shot steel surface defect detection
Deep learning-based algorithms have been widely employed to build reliable steel surface
defect detection systems, which are important for manufacturing. The performance of deep …
defect detection systems, which are important for manufacturing. The performance of deep …