A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Deep learning on mobile and embedded devices: State-of-the-art, challenges, and future directions

Y Chen, B Zheng, Z Zhang, Q Wang, C Shen… - ACM Computing …, 2020 - dl.acm.org
Recent years have witnessed an exponential increase in the use of mobile and embedded
devices. With the great success of deep learning in many fields, there is an emerging trend …

Depgraph: Towards any structural pruning

G Fang, X Ma, M Song, MB Mi… - Proceedings of the …, 2023 - openaccess.thecvf.com
Structural pruning enables model acceleration by removing structurally-grouped parameters
from neural networks. However, the parameter-grou** patterns vary widely across …

Repvgg: Making vgg-style convnets great again

X Ding, X Zhang, N Ma, J Han… - Proceedings of the …, 2021 - openaccess.thecvf.com
We present a simple but powerful architecture of convolutional neural network, which has a
VGG-like inference-time body composed of nothing but a stack of 3x3 convolution and …

What is the state of neural network pruning?

D Blalock, JJ Gonzalez Ortiz… - … of machine learning …, 2020 - proceedings.mlsys.org
Neural network pruning---the task of reducing the size of a network by removing parameters--
-has been the subject of a great deal of work in recent years. We provide a meta-analysis of …

Acnet: Strengthening the kernel skeletons for powerful cnn via asymmetric convolution blocks

X Ding, Y Guo, G Ding, J Han - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Abstract As designing appropriate Convolutional Neural Network (CNN) architecture in the
context of a given application usually involves heavy human works or numerous GPU hours …

Efficient tensor decomposition-based filter pruning

Y Zniyed, TP Nguyen - Neural Networks, 2024 - Elsevier
In this paper, we present CORING, which is short for effiCient tensOr decomposition-based
filteR prunING, a novel filter pruning methodology for neural networks. CORING is crafted to …

Resrep: Lossless cnn pruning via decoupling remembering and forgetting

X Ding, T Hao, J Tan, J Liu, J Han… - Proceedings of the …, 2021 - openaccess.thecvf.com
We propose ResRep, a novel method for lossless channel pruning (aka filter pruning), which
slims down a CNN by reducing the width (number of output channels) of convolutional …

Neural pruning via growing regularization

H Wang, C Qin, Y Zhang, Y Fu - arxiv preprint arxiv:2012.09243, 2020 - arxiv.org
Regularization has long been utilized to learn sparsity in deep neural network pruning.
However, its role is mainly explored in the small penalty strength regime. In this work, we …

Evc: Towards real-time neural image compression with mask decay

GH Wang, J Li, B Li, Y Lu - arxiv preprint arxiv:2302.05071, 2023 - arxiv.org
Neural image compression has surpassed state-of-the-art traditional codecs (H. 266/VVC)
for rate-distortion (RD) performance, but suffers from large complexity and separate models …