A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Efficient acceleration of deep learning inference on resource-constrained edge devices: A review

MMH Shuvo, SK Islam, J Cheng… - Proceedings of the …, 2022 - ieeexplore.ieee.org
Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

T Hoefler, D Alistarh, T Ben-Nun, N Dryden… - Journal of Machine …, 2021 - jmlr.org
The growing energy and performance costs of deep learning have driven the community to
reduce the size of neural networks by selectively pruning components. Similarly to their …

Pruning and quantization for deep neural network acceleration: A survey

T Liang, J Glossner, L Wang, S Shi, X Zhang - Neurocomputing, 2021 - Elsevier
Deep neural networks have been applied in many applications exhibiting extraordinary
abilities in the field of computer vision. However, complex network architectures challenge …

AutoML: A survey of the state-of-the-art

X He, K Zhao, X Chu - Knowledge-based systems, 2021 - Elsevier
Deep learning (DL) techniques have obtained remarkable achievements on various tasks,
such as image recognition, object detection, and language modeling. However, building a …

On the efficacy of knowledge distillation

JH Cho, B Hariharan - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
In this paper, we present a thorough evaluation of the efficacy of knowledge distillation and
its dependence on student and teacher architectures. Starting with the observation that more …

Green ai

R Schwartz, J Dodge, NA Smith, O Etzioni - Communications of the ACM, 2020 - dl.acm.org
Green AI Page 1 54 COMMUNICATIONS OF THE ACM | DECEMBER 2020 | VOL. 63 | NO.
12 contributed articles ILL US TRA TION B Y LIS A SHEEHAN DOI:10.1145/3381831 …

Importance estimation for neural network pruning

P Molchanov, A Mallya, S Tyree… - Proceedings of the …, 2019 - openaccess.thecvf.com
Structural pruning of neural network parameters reduces computational, energy, and
memory transfer costs during inference. We propose a novel method that estimates the …

Edge intelligence: Empowering intelligence to the edge of network

D Xu, T Li, Y Li, X Su, S Tarkoma, T Jiang… - Proceedings of the …, 2021 - ieeexplore.ieee.org
Edge intelligence refers to a set of connected systems and devices for data collection,
caching, processing, and analysis proximity to where data are captured based on artificial …

Mixhop: Higher-order graph convolutional architectures via sparsified neighborhood mixing

S Abu-El-Haija, B Perozzi, A Kapoor… - international …, 2019 - proceedings.mlr.press
Existing popular methods for semi-supervised learning with Graph Neural Networks (such
as the Graph Convolutional Network) provably cannot learn a general class of …