محقق Google

V Bolón-Canedo, L Morán-Fernández, B Cancela… - Neurocomputing, 2024‏ - Elsevier‏

Green artificial intelligence (AI) is more environmentally friendly and inclusive than
conventional AI, as it not only produces accurate results without increasing the …‏

ذخیره ارجاع بیان شده در 56 یافته مقاله‌های مربوط تمام نسخه‌های 4

[Free GPT-4]
[DeepSeek]

[PDF] sciencedirect.com

A survey of techniques for optimizing transformer inference‏

KT Chitty-Venkata, S Mittal, M Emani… - Journal of Systems …, 2023‏ - Elsevier‏

Recent years have seen a phenomenal rise in the performance and applications of
transformer neural networks. The family of transformer networks, including Bidirectional …‏

ذخیره ارجاع بیان شده در 70 یافته مقاله‌های مربوط تمام نسخه‌های 8

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Smoothquant: Accurate and efficient post-training quantization for large language models‏

G **ao, J Lin, M Seznec, H Wu… - International …, 2023‏ - proceedings.mlr.press‏

Large language models (LLMs) show excellent performance but are compute-and memory-
intensive. Quantization can reduce memory and accelerate inference. However, existing …‏

ذخیره ارجاع بیان شده در 844 یافته مقاله‌های مربوط تمام نسخه‌های 9 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Rethinking vision transformers for mobilenet size and speed‏

Y Li, J Hu, Y Wen, G Evangelidis… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

With the success of Vision Transformers (ViTs) in computer vision tasks, recent arts try to
optimize the performance and complexity of ViTs to enable efficient deployment on mobile …‏

ذخیره ارجاع بیان شده در 208 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Quip: 2-bit quantization of large language models with guarantees‏

J Chee, Y Cai, V Kuleshov… - Advances in Neural …, 2023‏ - proceedings.neurips.cc‏

This work studies post-training parameter quantization in large language models (LLMs).
We introduce quantization with incoherence processing (QuIP), a new method based on the …‏

ذخیره ارجاع بیان شده در 149 یافته مقاله‌های مربوط تمام نسخه‌های 9 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Zeroquant: Efficient and affordable post-training quantization for large-scale transformers‏

Z Yao, R Yazdani Aminabadi… - Advances in …, 2022‏ - proceedings.neurips.cc‏

How to efficiently serve ever-larger trained natural language models in practice has become
exceptionally challenging even for powerful cloud servers due to their prohibitive …‏

ذخیره ارجاع بیان شده در 400 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Tinyvit: Fast pretraining distillation for small vision transformers‏

K Wu, J Zhang, H Peng, M Liu, B **ao, J Fu… - European conference on …, 2022‏ - Springer‏

Vision transformer (ViT) recently has drawn great attention in computer vision due to its
remarkable model capability. However, most prevailing ViT models suffer from huge number …‏

ذخیره ارجاع بیان شده در 276 یافته مقاله‌های مربوط تمام نسخه‌های 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Shortgpt: Layers in large language models are more redundant than you expect‏

X Men, M Xu, Q Zhang, B Wang, H Lin, Y Lu… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

As Large Language Models (LLMs) continue to advance in performance, their size has
escalated significantly, with current LLMs containing billions or even trillions of parameters …‏

ذخیره ارجاع بیان شده در 110 یافته مقاله‌های مربوط تمام نسخه‌های 3 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Aging with grace: Lifelong model editing with discrete key-value adaptors‏

T Hartvigsen, S Sankaranarayanan… - Advances in …, 2023‏ - proceedings.neurips.cc‏

Deployed language models decay over time due to shifting inputs, changing user needs, or
emergent world-knowledge gaps. When such problems are identified, we want to make …‏

ذخیره ارجاع بیان شده در 121 یافته مقاله‌های مربوط تمام نسخه‌های 7 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] baai.ac.cn

A survey on vision transformer‏

K Han, Y Wang, H Chen, X Chen, J Guo… - IEEE transactions on …, 2022‏ - ieeexplore.ieee.org‏

Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …‏

ذخیره ارجاع بیان شده در 2753 یافته مقاله‌های مربوط تمام نسخه‌های 7

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

Post-training quantization for vision transformer

[HTML][HTML] A review of green artificial intelligence: Towards a more sustainable future‏

A survey of techniques for optimizing transformer inference‏

Smoothquant: Accurate and efficient post-training quantization for large language models‏

Rethinking vision transformers for mobilenet size and speed‏

Quip: 2-bit quantization of large language models with guarantees‏

Zeroquant: Efficient and affordable post-training quantization for large-scale transformers‏

Tinyvit: Fast pretraining distillation for small vision transformers‏

Shortgpt: Layers in large language models are more redundant than you expect‏

Aging with grace: Lifelong model editing with discrete key-value adaptors‏

A survey on vision transformer‏