- Academic Search

Y He, L **ao - IEEE transactions on pattern analysis and …, 2023 - ieeexplore.ieee.org

The remarkable performance of deep Convolutional neural networks (CNNs) is generally
attributed to their deeper and wider architectures, which can come with significant …

保存引用被引用数: 156 関連記事全 7 バージョン

Transforming large-size to lightweight deep neural networks for IoT applications

R Mishra, H Gupta - ACM Computing Surveys, 2023 - dl.acm.org

Deep Neural Networks (DNNs) have gained unprecedented popularity due to their high-
order performance and automated feature extraction capability. This has encouraged …

保存引用被引用数: 42 関連記事

[Free GPT-4]

[PDF] arxiv.org

Spvit: Enabling faster vision transformers via latency-aware soft token pruning

Z Kong, P Dong, X Ma, X Meng, W Niu, M Sun… - European conference on …, 2022 - Springer

Abstract Recently, Vision Transformer (ViT) has continuously established new milestones in
the computer vision field, while the high computation and memory cost makes its …

保存引用被引用数: 197 関連記事全 6 バージョン

[Free GPT-4]

[PDF] neurips.cc

Mest: Accurate and fast memory-economic sparse training framework on the edge

G Yuan, X Ma, W Niu, Z Li, Z Kong… - Advances in …, 2021 - proceedings.neurips.cc

Recently, a new trend of exploring sparsity for accelerating neural network training has
emerged, embracing the paradigm of training on the edge. This paper proposes a novel …

保存引用被引用数: 97 関連記事全 9 バージョン HTMLバージョン

[Free GPT-4]

[PDF] thecvf.com

Chex: Channel exploration for cnn model compression

Z Hou, M Qin, F Sun, X Ma, K Yuan… - Proceedings of the …, 2022 - openaccess.thecvf.com

Channel pruning has been broadly recognized as an effective technique to reduce the
computation and memory cost of deep convolutional neural networks. However …

保存引用被引用数: 93 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Forms: Fine-grained polarized reram-based in-situ computation for mixed-signal dnn accelerator

G Yuan, P Behnam, Z Li, A Shafiee… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org

Recent work demonstrated the promise of using resistive random access memory (ReRAM)
as an emerging technology to perform inherently parallel analog domain in-situ matrix …

保存引用被引用数: 78 関連記事全 6 バージョン

[Free GPT-4]

[PDF] ieee.org

Accelerating federated learning for iot in big data analytics with pruning, quantization and selective updating

W Xu, W Fang, Y Ding, M Zou, N **ong - IEEE Access, 2021 - ieeexplore.ieee.org

The ever-increasing number of Internet of Things (IoT) devices are continuously generating
huge masses of data, but the current cloud-centric approach for IoT big data analysis has …

保存引用被引用数: 106 関連記事全 3 バージョン

[Free GPT-4]

[PDF] ieee.org

Hardware acceleration of sparse and irregular tensor computations of ml models: A survey and insights

S Dave, R Baghdadi, T Nowatzki… - Proceedings of the …, 2021 - ieeexplore.ieee.org

Machine learning (ML) models are widely used in many important domains. For efficiently
processing these computational-and memory-intensive applications, tensors of these …

保存引用被引用数: 101 関連記事全 7 バージョン

[Free GPT-4]

[PDF] springer.com

A comprehensive review of model compression techniques in machine learning

PV Dantas, W Sabino da Silva Jr, LC Cordeiro… - Applied …, 2024 - Springer

This paper critically examines model compression techniques within the machine learning
(ML) domain, emphasizing their role in enhancing model efficiency for deployment in …

保存引用被引用数: 15 関連記事全 4 バージョン

[Free GPT-4]

[PDF] thecvf.com

Teachers do more than teach: Compressing image-to-image models

Q **, J Ren, OJ Woodford, J Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com

Abstract Generative Adversarial Networks (GANs) have achieved huge success in
generating high-fidelity images, however, they suffer from low efficiency due to tremendous …

保存引用被引用数: 70 関連記事全 7 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Non-structured DNN weight pruning—Is it beneficial in any platform?

Structured pruning for deep convolutional neural networks: A survey

Transforming large-size to lightweight deep neural networks for IoT applications

Spvit: Enabling faster vision transformers via latency-aware soft token pruning

Mest: Accurate and fast memory-economic sparse training framework on the edge

Chex: Channel exploration for cnn model compression

Forms: Fine-grained polarized reram-based in-situ computation for mixed-signal dnn accelerator

Accelerating federated learning for iot in big data analytics with pruning, quantization and selective updating

Hardware acceleration of sparse and irregular tensor computations of ml models: A survey and insights

A comprehensive review of model compression techniques in machine learning

Teachers do more than teach: Compressing image-to-image models