Efficient acceleration of deep learning inference on resource-constrained edge devices: A review

MMH Shuvo, SK Islam, J Cheng… - Proceedings of the …, 2022 - ieeexplore.ieee.org
Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …

Model compression for deep neural networks: A survey

Z Li, H Li, L Meng - Computers, 2023 - mdpi.com
Currently, with the rapid development of deep learning, deep neural networks (DNNs) have
been widely applied in various computer vision tasks. However, in the pursuit of …

YOLOv4-5D: An effective and efficient object detector for autonomous driving

Y Cai, T Luan, H Gao, H Wang, L Chen… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
The use of object detection algorithms has become extremely important in autonomous
vehicles. Object detection at high accuracy and a fast inference speed is essential for safe …

AutoML: A survey of the state-of-the-art

X He, K Zhao, X Chu - Knowledge-based systems, 2021 - Elsevier
Deep learning (DL) techniques have obtained remarkable achievements on various tasks,
such as image recognition, object detection, and language modeling. However, building a …

Filter pruning via geometric median for deep convolutional neural networks acceleration

Y He, P Liu, Z Wang, Z Hu… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Previous works utilized" smaller-norm-less-important" criterion to prune filters with smaller
norm values in a convolutional neural network. In this paper, we analyze this norm-based …

Learning filter pruning criteria for deep convolutional neural networks acceleration

Y He, Y Ding, P Liu, L Zhu… - Proceedings of the …, 2020 - openaccess.thecvf.com
Filter pruning has been widely applied to neural network compression and acceleration.
Existing methods usually utilize pre-defined pruning criteria, such as Lp-norm, to prune …

Dynamic channel pruning: Feature boosting and suppression

X Gao, Y Zhao, Ł Dudziak, R Mullins, C Xu - arxiv preprint arxiv …, 2018 - arxiv.org
Making deep convolutional neural networks more accurate typically comes at the cost of
increased computational and memory resources. In this paper, we reduce this cost by …

Scaling for edge inference of deep neural networks

X Xu, Y Ding, SX Hu, M Niemier, J Cong, Y Hu… - Nature Electronics, 2018 - nature.com
Deep neural networks offer considerable potential across a range of applications, from
advanced manufacturing to autonomous cars. A clear trend in deep neural networks is the …

Methods for pruning deep neural networks

S Vadera, S Ameen - IEEE Access, 2022 - ieeexplore.ieee.org
This paper presents a survey of methods for pruning deep neural networks. It begins by
categorising over 150 studies based on the underlying approach used and then focuses on …

Autoslim: Towards one-shot architecture search for channel numbers

J Yu, T Huang - arxiv preprint arxiv:1903.11728, 2019 - arxiv.org
We study how to set channel numbers in a neural network to achieve better accuracy under
constrained resources (eg, FLOPs, latency, memory footprint or model size). A simple and …