Академия Google

SI Young, W Zhe, D Taubman… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

In this paper, we compress convolutional neural network (CNN) weights post-training via
transform quantization. Previous CNN quantization techniques tend to ignore the joint …

Сохранить Цитировать Цитируется: 88 Похожие статьи Все версии статьи (8)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Optimal gradient compression for distributed and federated learning

A Albasyoni, M Safaryan, L Condat… - arxiv preprint arxiv …, 2020 - arxiv.org

Communicating information, like gradient vectors, between computing nodes in distributed
and federated learning is typically an unavoidable burden, resulting in scalability issues …

Сохранить Цитировать Цитируется: 68 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A gradient flow framework for analyzing network pruning

ES Lubana, RP Dick - arxiv preprint arxiv:2009.11839, 2020 - arxiv.org

Recent network pruning methods focus on pruning models early-on in training. To estimate
the impact of removing a parameter, these methods use importance measures that were …

Сохранить Цитировать Цитируется: 62 Похожие статьи Все версии статьи (3) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Finite blocklength lossy source coding for discrete memoryless sources

L Zhou, M Motani - Foundations and Trends® in …, 2023 - nowpublishers.com

Shannon propounded a theoretical framework (collectively called information theory) that
uses mathematical tools to understand, model and analyze modern mobile wireless …

Сохранить Цитировать Цитируется: 14 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

An information-theoretic justification for model pruning

B Isik, T Weissman, A No - International Conference on …, 2022 - proceedings.mlr.press

We study the neural network (NN) compression problem, viewing the tension between the
compression ratio and NN performance through the lens of rate-distortion theory. We choose …

Сохранить Цитировать Цитируется: 43 Похожие статьи Все версии статьи (6) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fundamental limitation of semantic communications: Neural estimation for rate-distortion

D Li, J Huang, C Huang, X Qin… - Journal of …, 2023 - ieeexplore.ieee.org

This paper studies the fundamental limit of semantic communications over the discrete
memoryless channel. We consider the scenario to send a semantic source consisting of an …

Сохранить Цитировать Цитируется: 6 Похожие статьи Все версии статьи (5)

[Free GPT-4]
[DeepSeek]

[PDF] ecva.net

Rdo-q: Extremely fine-grained channel-wise quantization via rate-distortion optimization

Z Wang, J Lin, X Geng, MMS Aly… - European Conference on …, 2022 - Springer

Allocating different bit widths to different channels and quantizing them independently bring
higher quantization precision and accuracy. Most of prior works use equal bit width to …

Сохранить Цитировать Цитируется: 8 Похожие статьи Все версии статьи (4)

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

On distributed quantization for classification

OA Hanna, YH Ezzeldin, T Sadjadpour… - IEEE Journal on …, 2020 - ieeexplore.ieee.org

We consider the problem of distributed feature quantization, where the goal is to enable a
pretrained classifier at a central node to carry out its classification on features that are …

Сохранить Цитировать Цитируется: 25 Похожие статьи Все версии статьи (5)

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] Population risk improvement with model compression: An information-theoretic approach

Y Bu, W Gao, S Zou, VV Veeravalli - Entropy, 2021 - mdpi.com

It has been reported in many recent works on deep model compression that the population
risk of a compressed model can be even better than that of the original model. In this paper …

Сохранить Цитировать Цитируется: 15 Похожие статьи Все версии статьи (11) Сохраненная копия

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Taxonomy and evaluation of structured compression of convolutional neural networks

A Kuzmin, M Nagel, S Pitre, S Pendyam… - arxiv preprint arxiv …, 2019 - arxiv.org

The success of deep neural networks in many real-world applications is leading to new
challenges in building more efficient architectures. One effective way of making networks …

Сохранить Цитировать Цитируется: 24 Похожие статьи Все версии статьи (2) В виде HTML

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Rate distortion for model compression: From theory to practice

Transform quantization for CNN compression

Optimal gradient compression for distributed and federated learning

A gradient flow framework for analyzing network pruning

Finite blocklength lossy source coding for discrete memoryless sources

An information-theoretic justification for model pruning

Fundamental limitation of semantic communications: Neural estimation for rate-distortion

Rdo-q: Extremely fine-grained channel-wise quantization via rate-distortion optimization

On distributed quantization for classification

[HTML][HTML] Population risk improvement with model compression: An information-theoretic approach

Taxonomy and evaluation of structured compression of convolutional neural networks