Академия Google

KT Chitty-Venkata, S Mittal, M Emani… - Journal of Systems …, 2023 - Elsevier

Recent years have seen a phenomenal rise in the performance and applications of
transformer neural networks. The family of transformer networks, including Bidirectional …

Сохранить Цитировать Цитируется: 69 Похожие статьи Все версии статьи (8)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A comprehensive review of binary neural network

C Yuan, SS Agaian - Artificial Intelligence Review, 2023 - Springer

Deep learning (DL) has recently changed the development of intelligent systems and is
widely adopted in many real-life applications. Despite their various benefits and potentials …

Сохранить Цитировать Цитируется: 123 Похожие статьи Все версии статьи (7)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Llm-qat: Data-free quantization aware training for large language models

Z Liu, B Oguz, C Zhao, E Chang, P Stock… - arxiv preprint arxiv …, 2023 - arxiv.org

Several post-training quantization methods have been applied to large language models
(LLMs), and have been shown to perform well down to 8-bits. We find that these methods …

Сохранить Цитировать Цитируется: 235 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Pb-llm: Partially binarized large language models

Y Shang, Z Yuan, Q Wu, Z Dong - arxiv preprint arxiv:2310.00034, 2023 - arxiv.org

This paper explores network binarization, a radical form of quantization, compressing model
weights to a single bit, specifically for Large Language Models (LLMs) compression. Due to …

Сохранить Цитировать Цитируется: 50 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Bibench: Benchmarking and analyzing network binarization

H Qin, M Zhang, Y Ding, A Li, Z Cai… - International …, 2023 - proceedings.mlr.press

Network binarization emerges as one of the most promising compression approaches
offering extraordinary computation and memory savings by minimizing the bit-width …

Сохранить Цитировать Цитируется: 49 Похожие статьи Все версии статьи (9) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Bivit: Extremely compressed binary vision transformers

Y He, Z Lou, L Zhang, J Liu, W Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Model binarization can significantly compress model size, reduce energy
consumption, and accelerate inference through efficient bit-wise operations. Although …

Сохранить Цитировать Цитируется: 37 Похожие статьи Все версии статьи (6) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Binaryvit: Pushing binary vision transformers towards convolutional models

PHC Le, X Li - Proceedings of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com

With the increasing popularity and the increasing size of vision transformers (ViTs), there
has been an increasing interest in making them more efficient and less computationally …

Сохранить Цитировать Цитируется: 24 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

[PDF][PDF] Scalable matmul-free language modeling

RJ Zhu, Y Zhang, E Sifferman, T Sheaves… - arxiv preprint arxiv …, 2024 - openreview.net

Matrix multiplication (MatMul) typically dominates the overall computational cost of large
language models (LLMs). This cost only grows as LLMs scale to larger embedding …

Сохранить Цитировать Цитируется: 19 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Db-llm: Accurate dual-binarization for efficient llms

H Chen, C Lv, L Ding, H Qin, X Zhou, Y Ding… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs) have significantly advanced the field of natural language
processing, while the expensive memory and computation consumption impede their …

Сохранить Цитировать Цитируется: 18 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Shiftaddvit: Mixture of multiplication primitives towards efficient vision transformer

H You, H Shi, Y Guo, Y Lin - Advances in Neural …, 2023 - proceedings.neurips.cc

Abstract Vision Transformers (ViTs) have shown impressive performance and have become
a unified backbone for multiple vision tasks. However, both the attention mechanism and …

Сохранить Цитировать Цитируется: 16 Похожие статьи Все версии статьи (9) В виде HTML

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Bit: Robustly binarized multi-distilled transformer

A survey of techniques for optimizing transformer inference

A comprehensive review of binary neural network

Llm-qat: Data-free quantization aware training for large language models

Pb-llm: Partially binarized large language models

Bibench: Benchmarking and analyzing network binarization

Bivit: Extremely compressed binary vision transformers

Binaryvit: Pushing binary vision transformers towards convolutional models

[PDF][PDF] Scalable matmul-free language modeling

Db-llm: Accurate dual-binarization for efficient llms

Shiftaddvit: Mixture of multiplication primitives towards efficient vision transformer