Efficient acceleration of deep learning inference on resource-constrained edge devices: A review
Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …
in breakthroughs in many areas. However, deploying these highly accurate models for data …
Model compression for deep neural networks: A survey
Currently, with the rapid development of deep learning, deep neural networks (DNNs) have
been widely applied in various computer vision tasks. However, in the pursuit of …
been widely applied in various computer vision tasks. However, in the pursuit of …
YOLOv4-5D: An effective and efficient object detector for autonomous driving
Y Cai, T Luan, H Gao, H Wang, L Chen… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
The use of object detection algorithms has become extremely important in autonomous
vehicles. Object detection at high accuracy and a fast inference speed is essential for safe …
vehicles. Object detection at high accuracy and a fast inference speed is essential for safe …
AutoML: A survey of the state-of-the-art
Deep learning (DL) techniques have obtained remarkable achievements on various tasks,
such as image recognition, object detection, and language modeling. However, building a …
such as image recognition, object detection, and language modeling. However, building a …
Filter pruning via geometric median for deep convolutional neural networks acceleration
Previous works utilized" smaller-norm-less-important" criterion to prune filters with smaller
norm values in a convolutional neural network. In this paper, we analyze this norm-based …
norm values in a convolutional neural network. In this paper, we analyze this norm-based …
Learning filter pruning criteria for deep convolutional neural networks acceleration
Filter pruning has been widely applied to neural network compression and acceleration.
Existing methods usually utilize pre-defined pruning criteria, such as Lp-norm, to prune …
Existing methods usually utilize pre-defined pruning criteria, such as Lp-norm, to prune …
Dynamic channel pruning: Feature boosting and suppression
Making deep convolutional neural networks more accurate typically comes at the cost of
increased computational and memory resources. In this paper, we reduce this cost by …
increased computational and memory resources. In this paper, we reduce this cost by …
Scaling for edge inference of deep neural networks
Deep neural networks offer considerable potential across a range of applications, from
advanced manufacturing to autonomous cars. A clear trend in deep neural networks is the …
advanced manufacturing to autonomous cars. A clear trend in deep neural networks is the …
Methods for pruning deep neural networks
This paper presents a survey of methods for pruning deep neural networks. It begins by
categorising over 150 studies based on the underlying approach used and then focuses on …
categorising over 150 studies based on the underlying approach used and then focuses on …
Autoslim: Towards one-shot architecture search for channel numbers
We study how to set channel numbers in a neural network to achieve better accuracy under
constrained resources (eg, FLOPs, latency, memory footprint or model size). A simple and …
constrained resources (eg, FLOPs, latency, memory footprint or model size). A simple and …