A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

A survey on efficient convolutional neural networks and hardware acceleration

D Ghimire, D Kil, S Kim - Electronics, 2022 - mdpi.com
Over the past decade, deep-learning-based representations have demonstrated remarkable
performance in academia and industry. The learning capability of convolutional neural …

Depgraph: Towards any structural pruning

G Fang, X Ma, M Song, MB Mi… - Proceedings of the …, 2023 - openaccess.thecvf.com
Structural pruning enables model acceleration by removing structurally-grouped parameters
from neural networks. However, the parameter-grou** patterns vary widely across …

Structured pruning for deep convolutional neural networks: A survey

Y He, L **ao - IEEE transactions on pattern analysis and …, 2023 - ieeexplore.ieee.org
The remarkable performance of deep Convolutional neural networks (CNNs) is generally
attributed to their deeper and wider architectures, which can come with significant …

Revisiting random channel pruning for neural network compression

Y Li, K Adamczewski, W Li, S Gu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Channel (or 3D filter) pruning serves as an effective way to accelerate the inference of
neural networks. There has been a flurry of algorithms that try to solve this practical problem …

Convolutional neural network pruning with structural redundancy reduction

Z Wang, C Li, X Wang - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Convolutional neural network (CNN) pruning has become one of the most successful
network compression approaches in recent years. Existing works on network pruning …

Chip: Channel independence-based pruning for compact neural networks

Y Sui, M Yin, Y **e, H Phan… - Advances in Neural …, 2021 - proceedings.neurips.cc
Filter pruning has been widely used for neural network compression because of its enabled
practical acceleration. To date, most of the existing filter pruning works explore the …

Local relation learning for face forgery detection

S Chen, T Yao, Y Chen, S Ding, J Li, R Ji - Proceedings of the AAAI …, 2021 - ojs.aaai.org
With the rapid development of facial manipulation techniques, face forgery has received
considerable attention in digital media forensics due to security concerns. Most existing …

Group fisher pruning for practical network compression

L Liu, S Zhang, Z Kuang, A Zhou… - International …, 2021 - proceedings.mlr.press
Network compression has been widely studied since it is able to reduce the memory and
computation cost during inference. However, previous methods seldom deal with …

Eagleeye: Fast sub-net evaluation for efficient neural network pruning

B Li, B Wu, J Su, G Wang - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
Finding out the computational redundant part of a trained Deep Neural Network (DNN) is the
key question that pruning algorithms target on. Many algorithms try to predict model …