- Academic Search

HI Liu, M Galindo, H **e, LK Wong, HH Shuai… - ACM Computing …, 2024 - dl.acm.org

Over the past decade, the dominance of deep learning has prevailed across various
domains of artificial intelligence, including natural language processing, computer vision …

Simpan Kutip Dirujuk 32 kali Artikel terkait 3 versi

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision

X Luo, D Liu, H Kong, S Huai, H Chen… - ACM Transactions on …, 2024 - dl.acm.org

Deep neural networks (DNNs) have recently achieved impressive success across a wide
range of real-world vision and language processing tasks, spanning from image …

Simpan Kutip Dirujuk 1 kali Artikel terkait 4 versi

A 95.6-TOPS/W deep learning inference accelerator with per-vector scaled 4-bit quantization in 5 nm

B Keller, R Venkatesan, S Dai, SG Tell… - IEEE Journal of Solid …, 2023 - ieeexplore.ieee.org

The energy efficiency of deep neural network (DNN) inference can be improved with custom
accelerators. DNN inference accelerators often employ specialized hardware techniques to …

Simpan Kutip Dirujuk 29 kali Artikel terkait 2 versi

Demystifying bert: System design implications

S Pati, S Aga, N Jayasena… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org

Transfer learning in natural language processing (NLP) uses increasingly large models that
tackle challenging problems. Consequently, these applications are driving the requirements …

Simpan Kutip Dirujuk 29 kali Artikel terkait 2 versi

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

NIPQ: Noise proxy-based integrated pseudo-quantization

J Shin, J So, S Park, S Kang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Straight-through estimator (STE), which enables the gradient flow over the non-
differentiable function via approximation, has been favored in studies related to quantization …

Simpan Kutip Dirujuk 23 kali Artikel terkait 5 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Unit scaling: Out-of-the-box low-precision training

C Blake, D Orr, C Luschi - International Conference on …, 2023 - proceedings.mlr.press

We present unit scaling, a paradigm for designing deep learning models that simplifies the
use of low-precision number formats. Training in FP16 or the recently proposed FP8 formats …

Simpan Kutip Dirujuk 6 kali Artikel terkait 6 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[HTML] mdpi.com

[HTML][HTML] Assessing the influence of sensor-induced noise on machine-learning-based changeover detection in CNC machines

VG Biju, AM Schmitt, B Engelmann - Sensors, 2024 - mdpi.com

The noise in sensor data has a substantial impact on the reliability and accuracy of (ML)
algorithms. A comprehensive framework is proposed to analyze the effects of diverse noise …

Simpan Kutip Dirujuk 6 kali Artikel terkait 8 versi Cache

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

2-bit conformer quantization for automatic speech recognition

O Rybakov, P Meadowlark, S Ding, D Qiu, J Li… - arxiv preprint arxiv …, 2023 - arxiv.org

Large speech models are rapidly gaining traction in research community. As a result, model
compression has become an important topic, so that these models can fit in memory and be …

Simpan Kutip Dirujuk 10 kali Artikel terkait 4 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Powerquant: Automorphism search for non-uniform quantization

E Yvinec, A Dapogny, M Cord, K Bailly - arxiv preprint arxiv:2301.09858, 2023 - arxiv.org

Deep neural networks (DNNs) are nowadays ubiquitous in many domains such as computer
vision. However, due to their high latency, the deployment of DNNs hinges on the …

Simpan Kutip Dirujuk 17 kali Artikel terkait 5 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bitdistiller: Unleashing the potential of sub-4-bit llms via self-distillation

D Du, Y Zhang, S Cao, J Guo, T Cao, X Chu… - arxiv preprint arxiv …, 2024 - arxiv.org

The upscaling of Large Language Models (LLMs) has yielded impressive advances in
natural language processing, yet it also poses significant deployment challenges. Weight …

Simpan Kutip Dirujuk 14 kali Artikel terkait 2 versi Versi HTML

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Optimal clip** and magnitude-aware differentiation for improved quantization-aware training

Lightweight Deep Learning for Resource-Constrained Environments: A Survey

Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision

A 95.6-TOPS/W deep learning inference accelerator with per-vector scaled 4-bit quantization in 5 nm

Demystifying bert: System design implications

NIPQ: Noise proxy-based integrated pseudo-quantization

Unit scaling: Out-of-the-box low-precision training

[HTML][HTML] Assessing the influence of sensor-induced noise on machine-learning-based changeover detection in CNC machines

2-bit conformer quantization for automatic speech recognition

Powerquant: Automorphism search for non-uniform quantization

Bitdistiller: Unleashing the potential of sub-4-bit llms via self-distillation