- Academic Search

KT Chitty-Venkata, S Mittal, M Emani… - Journal of Systems …, 2023 - Elsevier

Recent years have seen a phenomenal rise in the performance and applications of
transformer neural networks. The family of transformer networks, including Bidirectional …

Save Cite Cited by 68 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arxiv preprint arxiv:2106.15561, 2021 - arxiv.org

Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

Save Cite Cited by 467 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Meta learning for natural language processing: A survey

H Lee, SW Li, NT Vu - arxiv preprint arxiv:2205.01500, 2022 - arxiv.org

Deep learning has been the mainstream technique in natural language processing (NLP)
area. However, the techniques require many labeled data and are less generalizable across …

Save Cite Cited by 54 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Squeezellm: Dense-and-sparse quantization

S Kim, C Hooper, A Gholami, Z Dong, X Li… - arxiv preprint arxiv …, 2023 - arxiv.org

Generative Large Language Models (LLMs) have demonstrated remarkable results for a
wide range of tasks. However, deploying these models for inference has been a significant …

Save Cite Cited by 166 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

A fast post-training pruning framework for transformers

W Kwon, S Kim, MW Mahoney… - Advances in …, 2022 - proceedings.neurips.cc

Pruning is an effective way to reduce the huge inference cost of Transformer models.
However, prior work on pruning Transformers requires retraining the models. This can add …

Save Cite Cited by 141 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Speculative decoding with big little decoder

S Kim, K Mangalam, S Moon, J Malik… - Advances in …, 2024 - proceedings.neurips.cc

The recent emergence of Large Language Models based on the Transformer architecture
has enabled dramatic advancements in the field of Natural Language Processing. However …

Save Cite Cited by 70 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Full stack optimization of transformer inference: a survey

S Kim, C Hooper, T Wattanawong, M Kang… - arxiv preprint arxiv …, 2023 - arxiv.org

Recent advances in state-of-the-art DNN architecture design have been moving toward
Transformer models. These models achieve superior accuracy across a wide range of …

Save Cite Cited by 94 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aaai.org

Spikingbert: Distilling bert to train spiking language models using implicit differentiation

M Bal, A Sengupta - Proceedings of the AAAI conference on artificial …, 2024 - ojs.aaai.org

Large language Models (LLMs), though growing exceedingly powerful, comprises of orders
of magnitude less neurons and synapses than the human brain. However, it requires …

Save Cite Cited by 41 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] ieee.org

Neural architecture search for transformers: A survey

KT Chitty-Venkata, M Emani, V Vishwanath… - IEEE …, 2022 - ieeexplore.ieee.org

Transformer-based Deep Neural Network architectures have gained tremendous interest
due to their effectiveness in various applications across Natural Language Processing (NLP) …

Save Cite Cited by 78 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] ieee.org

Nas-bench-nlp: neural architecture search benchmark for natural language processing

N Klyuchnikov, I Trofimov, E Artemova… - IEEE …, 2022 - ieeexplore.ieee.org

Neural Architecture Search (NAS) is a promising and rapidly evolving research area.
Training a large number of neural networks requires an exceptional amount of …

Save Cite Cited by 137 Related articles All 5 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

NAS-BERT: Task-agnostic and adaptive-size BERT compression with neural architecture search

A survey of techniques for optimizing transformer inference

A survey on neural speech synthesis

Meta learning for natural language processing: A survey

Squeezellm: Dense-and-sparse quantization

A fast post-training pruning framework for transformers

Speculative decoding with big little decoder

Full stack optimization of transformer inference: a survey

Spikingbert: Distilling bert to train spiking language models using implicit differentiation

Neural architecture search for transformers: A survey

Nas-bench-nlp: neural architecture search benchmark for natural language processing