- Academic Search

Y He, L **ao - IEEE transactions on pattern analysis and …, 2023 - ieeexplore.ieee.org

The remarkable performance of deep Convolutional neural networks (CNNs) is generally
attributed to their deeper and wider architectures, which can come with significant …

บันทึก อ้างอิง อ้างโดย166 บทความที่เกี่ยวข้อง ทั้งหมด 11 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] optica.org

Artificial neural networks for photonic applications—from algorithms to implementation: tutorial

P Freire, E Manuylovich, JE Prilepsky… - Advances in Optics and …, 2023 - opg.optica.org

This tutorial–review on applications of artificial neural networks in photonics targets a broad
audience, ranging from optical research and engineering communities to computer science …

บันทึก อ้างอิง อ้างโดย43 บทความที่เกี่ยวข้อง ทั้งหมด 7 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A white paper on neural network quantization

M Nagel, M Fournarakis, RA Amjad… - arxiv preprint arxiv …, 2021 - arxiv.org

While neural networks have advanced the frontiers in many applications, they often come at
a high computational cost. Reducing the power and latency of neural network inference is …

บันทึก อ้างอิง อ้างโดย632 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of quantization methods for efficient neural network inference

A Gholami, S Kim, Z Dong, Z Yao… - Low-power computer …, 2022 - taylorfrancis.com

This chapter provides approaches to the problem of quantizing the numerical values in deep
Neural Network computations, covering the advantages/disadvantages of current methods …

บันทึก อ้างอิง อ้างโดย1406 บทความที่เกี่ยวข้อง ทั้งหมด 5 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

T Hoefler, D Alistarh, T Ben-Nun, N Dryden… - Journal of Machine …, 2021 - jmlr.org

The growing energy and performance costs of deep learning have driven the community to
reduce the size of neural networks by selectively pruning components. Similarly to their …

บันทึก อ้างอิง อ้างโดย891 บทความที่เกี่ยวข้อง ทั้งหมด 27 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Pruning vs quantization: Which is better?

A Kuzmin, M Nagel, M Van Baalen… - Advances in neural …, 2023 - proceedings.neurips.cc

Neural network pruning and quantization techniques are almost as old as neural networks
themselves. However, to date, only ad-hoc comparisons between the two have been …

บันทึก อ้างอิง อ้างโดย52 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Coin: Compression with implicit neural representations

E Dupont, A Goliński, M Alizadeh, YW Teh… - arxiv preprint arxiv …, 2021 - arxiv.org

We propose a new simple approach for image compression: instead of storing the RGB
values for each pixel of an image, we store the weights of a neural network overfitted to the …

บันทึก อ้างอิง อ้างโดย238 บทความที่เกี่ยวข้อง ทั้งหมด 4 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Hawq-v3: Dyadic neural network quantization

Z Yao, Z Dong, Z Zheng, A Gholami… - International …, 2021 - proceedings.mlr.press

Current low-precision quantization algorithms often have the hidden cost of conversion back
and forth from floating point to quantized integer values. This hidden cost limits the latency …

บันทึก อ้างอิง อ้างโดย288 บทความที่เกี่ยวข้อง ทั้งหมด 8 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Understanding and overcoming the challenges of efficient transformer quantization

Y Bondarenko, M Nagel, T Blankevoort - arxiv preprint arxiv:2109.12948, 2021 - arxiv.org

Transformer-based architectures have become the de-facto standard models for a wide
range of Natural Language Processing tasks. However, their memory footprint and high …

บันทึก อ้างอิง อ้างโดย145 บทความที่เกี่ยวข้อง ทั้งหมด 4 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Only train once: A one-shot neural network training and pruning framework

T Chen, B Ji, T Ding, B Fang, G Wang… - Advances in …, 2021 - proceedings.neurips.cc

Structured pruning is a commonly used technique in deploying deep neural networks
(DNNs) onto resource-constrained devices. However, the existing pruning methods are …

บันทึก อ้างอิง อ้างโดย138 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ ดูในรูปแบบ HTML

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

Bayesian bits: Unifying quantization and pruning

Structured pruning for deep convolutional neural networks: A survey

Artificial neural networks for photonic applications—from algorithms to implementation: tutorial

A white paper on neural network quantization

A survey of quantization methods for efficient neural network inference

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

Pruning vs quantization: Which is better?

Coin: Compression with implicit neural representations

Hawq-v3: Dyadic neural network quantization

Understanding and overcoming the challenges of efficient transformer quantization

Only train once: A one-shot neural network training and pruning framework