Google Наука

Y He, L **ao - IEEE transactions on pattern analysis and …, 2023 - ieeexplore.ieee.org

The remarkable performance of deep Convolutional neural networks (CNNs) is generally
attributed to their deeper and wider architectures, which can come with significant …

Запазване Позоваване С позовавания в 168 Сродни статии Всички 11 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A comprehensive survey on model quantization for deep neural networks in image classification

B Rokh, A Azarpeyvand, A Khanteymoori - ACM Transactions on …, 2023 - dl.acm.org

Recent advancements in machine learning achieved by Deep Neural Networks (DNNs)
have been significant. While demonstrating high accuracy, DNNs are associated with a …

Запазване Позоваване С позовавания в 97 Сродни статии Всички 4 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Squeezellm: Dense-and-sparse quantization

S Kim, C Hooper, A Gholami, Z Dong, X Li… - arxiv preprint arxiv …, 2023 - arxiv.org

Generative Large Language Models (LLMs) have demonstrated remarkable results for a
wide range of tasks. However, deploying these models for inference has been a significant …

Запазване Позоваване С позовавания в 186 Сродни статии Всички 8 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of quantization methods for efficient neural network inference

A Gholami, S Kim, Z Dong, Z Yao… - Low-power computer …, 2022 - taylorfrancis.com

This chapter provides approaches to the problem of quantizing the numerical values in deep
Neural Network computations, covering the advantages/disadvantages of current methods …

Запазване Позоваване С позовавания в 1409 Сродни статии Всички 5 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Full stack optimization of transformer inference: a survey

S Kim, C Hooper, T Wattanawong, M Kang… - arxiv preprint arxiv …, 2023 - arxiv.org

Recent advances in state-of-the-art DNN architecture design have been moving toward
Transformer models. These models achieve superior accuracy across a wide range of …

Запазване Позоваване С позовавания в 99 Сродни статии Всички 4 версии Във вид на HTML

LungNet: A hybrid deep-CNN model for lung cancer diagnosis using CT and wearable sensor-based medical IoT data

N Faruqui, MA Yousuf, M Whaiduzzaman… - Computers in Biology …, 2021 - Elsevier

Lung cancer, also known as pulmonary cancer, is one of the deadliest cancers, but yet
curable if detected at the early stage. At present, the ambiguous features of the lung cancer …

Запазване Позоваване С позовавания в 159 Сродни статии Всички 9 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The optimal bert surgeon: Scalable and accurate second-order pruning for large language models

E Kurtic, D Campos, T Nguyen, E Frantar… - arxiv preprint arxiv …, 2022 - arxiv.org

Transformer-based language models have become a key building block for natural
language processing. While these models are extremely accurate, they can be too large and …

Запазване Позоваване С позовавания в 134 Сродни статии Всички 4 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Squant: On-the-fly data-free quantization via diagonal hessian approximation

C Guo, Y Qiu, J Leng, X Gao, C Zhang, Y Liu… - arxiv preprint arxiv …, 2022 - arxiv.org

Quantization of deep neural networks (DNN) has been proven effective for compressing and
accelerating DNN models. Data-free quantization (DFQ) is a promising approach without the …

Запазване Позоваване С позовавания в 73 Сродни статии Всички 6 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] frontiersin.org

Applications and techniques for fast machine learning in science

AMC Deiana, N Tran, J Agar, M Blott… - Frontiers in big …, 2022 - frontiersin.org

In this community review report, we discuss applications and techniques for fast machine
learning (ML) in science—the concept of integrating powerful ML methods into the real-time …

Запазване Позоваване С позовавания в 68 Сродни статии Всички 28 версии Търсене на библиотеки Кеширана версия

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Global vision transformer pruning with hessian-aware saliency

H Yang, H Yin, M Shen, P Molchanov… - Proceedings of the …, 2023 - openaccess.thecvf.com

Transformers yield state-of-the-art results across many tasks. However, their heuristically
designed architecture impose huge computational costs during inference. This work aims on …

Запазване Позоваване С позовавания в 52 Сродни статии Всички 9 версии Търсене на библиотеки Във вид на HTML

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Hessian-aware pruning and optimal neural implant

Structured pruning for deep convolutional neural networks: A survey

A comprehensive survey on model quantization for deep neural networks in image classification

Squeezellm: Dense-and-sparse quantization

A survey of quantization methods for efficient neural network inference

Full stack optimization of transformer inference: a survey

LungNet: A hybrid deep-CNN model for lung cancer diagnosis using CT and wearable sensor-based medical IoT data

The optimal bert surgeon: Scalable and accurate second-order pruning for large language models

Squant: On-the-fly data-free quantization via diagonal hessian approximation

Applications and techniques for fast machine learning in science

Global vision transformer pruning with hessian-aware saliency