Bilinear attention networks

JH Kim, J Jun, BT Zhang - Advances in neural information …, 2018 - proceedings.neurips.cc
Attention networks in multimodal learning provide an efficient way to utilize given visual
information selectively. However, the computational cost to learn attention distributions for …

Inverted residuals and linear bottlenecks: Mobile networks for classification, detection and segmentation

A Howard, A Zhmoginov, LC Chen, M Sandler… - Proc. CVPR, 2018 - research.google
In this paper we describe a new mobile architecture MobileNetV2 that improves the state of
the art performance of mobile models on multiple benchmarks across a spectrum of different …

A gift from knowledge distillation: Fast optimization, network minimization and transfer learning

J Yim, D Joo, J Bae, J Kim - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
We introduce a novel technique for knowledge transfer, where knowledge from a pretrained
deep neural network (DNN) is distilled and transferred to another DNN. As the DNN …

The one hundred layers tiramisu: Fully convolutional densenets for semantic segmentation

S Jégou, M Drozdzal, D Vazquez… - Proceedings of the …, 2017 - openaccess.thecvf.com
State-of-the-art approaches for semantic image segmentation are built on Convolutional
Neural Networks (CNNs). The typical segmentation architecture is composed of (a) a …

Deep pyramidal residual networks for spectral–spatial hyperspectral image classification

ME Paoletti, JM Haut… - … on Geoscience and …, 2018 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) exhibit good performance in image processing tasks,
pointing themselves as the current state-of-the-art of deep learning methods. However, the …

Gaussian error linear units (gelus)

D Hendrycks, K Gimpel - arxiv preprint arxiv:1606.08415, 2016 - arxiv.org
We propose the Gaussian Error Linear Unit (GELU), a high-performing neural network
activation function. The GELU activation function is $ x\Phi (x) $, where $\Phi (x) $ the …

Beyond finite layer neural networks: Bridging deep architectures and numerical differential equations

Y Lu, A Zhong, Q Li, B Dong - International Conference on …, 2018 - proceedings.mlr.press
Deep neural networks have become the state-of-the-art models in numerous machine
learning tasks. However, general guidance to network architecture design is still missing. In …

Deep pyramidal residual networks

D Han, J Kim, J Kim - … of the IEEE conference on computer …, 2017 - openaccess.thecvf.com
Deep convolutional neural networks (DCNNs) have shown remarkable performance in
image classification tasks in recent years. Generally, deep neural network architectures are …

[HTML][HTML] Toward an integration of deep learning and neuroscience

AH Marblestone, G Wayne, KP Kording - Frontiers in computational …, 2016 - frontiersin.org
Neuroscience has focused on the detailed implementation of computation, studying neural
codes, dynamics and circuits. In machine learning, however, artificial neural networks tend …

Few-shot steel surface defect detection

H Wang, Z Li, H Wang - IEEE Transactions on Instrumentation …, 2021 - ieeexplore.ieee.org
Deep learning-based algorithms have been widely employed to build reliable steel surface
defect detection systems, which are important for manufacturing. The performance of deep …