Deep neural network approximation for custom hardware: Where we've been, where we're going

E Wang, JJ Davis, R Zhao, HC Ng, X Niu… - ACM Computing …, 2019 - dl.acm.org
Deep neural networks have proven to be particularly effective in visual and audio
recognition tasks. Existing models tend to be computationally expensive and memory …

Machine learning in resource-scarce embedded systems, FPGAs, and end-devices: A survey

S Branco, AG Ferreira, J Cabral - Electronics, 2019 - mdpi.com
The number of devices connected to the Internet is increasing, exchanging large amounts of
data, and turning the Internet into the 21st-century silk road for data. This road has taken …

Bi-real net: Enhancing the performance of 1-bit cnns with improved representational capability and advanced training algorithm

Z Liu, B Wu, W Luo, X Yang, W Liu… - Proceedings of the …, 2018 - openaccess.thecvf.com
In this work, we study the 1-bit convolutional neural networks (CNNs), of which both the
weights and activations are binary. While being efficient, the classification accuracy of the …

Hello edge: Keyword spotting on microcontrollers

Y Zhang, N Suda, L Lai, V Chandra - arxiv preprint arxiv:1711.07128, 2017 - arxiv.org
Keyword spotting (KWS) is a critical component for enabling speech based user interactions
on smart devices. It requires real-time response and high accuracy for good user …

Cmsis-nn: Efficient neural network kernels for arm cortex-m cpus

L Lai, N Suda, V Chandra - arxiv preprint arxiv:1801.06601, 2018 - arxiv.org
Deep Neural Networks are becoming increasingly popular in always-on IoT edge devices
performing data analytics right at the source, reducing latency as well as energy …

UNPU: An energy-efficient deep neural network accelerator with fully variable weight bit precision

J Lee, C Kim, S Kang, D Shin, S Kim… - IEEE Journal of Solid …, 2018 - ieeexplore.ieee.org
An energy-efficient deep neural network (DNN) accelerator, unified neural processing unit
(UNPU), is proposed for mobile deep learning applications. The UNPU can support both …

Relaxed quantization for discretized neural networks

C Louizos, M Reisser, T Blankevoort, E Gavves… - arxiv preprint arxiv …, 2018 - arxiv.org
Neural network quantization has become an important research area due to its great impact
on deployment of large models on resource constrained devices. In order to train networks …

Zero-centered fixed-point quantization with iterative retraining for deep convolutional neural network-based object detectors

S Kim, H Kim - IEEE Access, 2021 - ieeexplore.ieee.org
In the field of object detection, deep learning has greatly improved accuracy compared to
previous algorithms and has been used widely in recent years. However, object detection …

Effnet: An efficient structure for convolutional neural networks

I Freeman, L Roese-Koerner… - 2018 25th ieee …, 2018 - ieeexplore.ieee.org
With the ever increasing application of Convolutional Neural Networks to customer products
the need emerges for models to efficiently run on embedded, mobile hardware. Slimmer …

The effects of approximate multiplication on convolutional neural networks

MS Kim, AA Del Barrio, H Kim… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This article analyzes the effects of approximate multiplication when performing inferences on
deep convolutional neural networks (CNNs). The approximate multiplication can reduce the …