- Academic Search

Y He, L **ao - IEEE transactions on pattern analysis and …, 2023 - ieeexplore.ieee.org

The remarkable performance of deep Convolutional neural networks (CNNs) is generally
attributed to their deeper and wider architectures, which can come with significant …

Uložit Citovat Počet citací tohoto článku: 160 Související články Všechny verze (počet: 7)

A comprehensive survey on model compression and acceleration

T Choudhary, V Mishra, A Goswami… - Artificial Intelligence …, 2020 - Springer

In recent years, machine learning (ML) and deep learning (DL) have shown remarkable
improvement in computer vision, natural language processing, stock prediction, forecasting …

Uložit Citovat Počet citací tohoto článku: 538 Související články Všechny verze (počet: 8)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Flashattention: Fast and memory-efficient exact attention with io-awareness

T Dao, D Fu, S Ermon, A Rudra… - Advances in Neural …, 2022 - proceedings.neurips.cc

Transformers are slow and memory-hungry on long sequences, since the time and memory
complexity of self-attention are quadratic in sequence length. Approximate attention …

Uložit Citovat Počet citací tohoto článku: 1754 Související články Všechny verze (počet: 10) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

On-device training under 256kb memory

J Lin, L Zhu, WM Chen, WC Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc

On-device training enables the model to adapt to new data collected from the sensors by
fine-tuning a pre-trained model. Users can benefit from customized AI models without having …

Uložit Citovat Počet citací tohoto článku: 230 Související články Všechny verze (počet: 8) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

A-vit: Adaptive tokens for efficient vision transformer

H Yin, A Vahdat, JM Alvarez, A Mallya… - Proceedings of the …, 2022 - openaccess.thecvf.com

We introduce A-ViT, a method that adaptively adjusts the inference cost of vision transformer
ViT for images of different complexity. A-ViT achieves this by automatically reducing the …

Uložit Citovat Počet citací tohoto článku: 297 Související články Všechny verze (počet: 4) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dynamic neural networks: A survey

Y Han, G Huang, S Song, L Yang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Dynamic neural network is an emerging research topic in deep learning. Compared to static
models which have fixed computational graphs and parameters at the inference stage …

Uložit Citovat Počet citací tohoto článku: 787 Související články Všechny verze (počet: 7)

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

T Hoefler, D Alistarh, T Ben-Nun, N Dryden… - Journal of Machine …, 2021 - jmlr.org

The growing energy and performance costs of deep learning have driven the community to
reduce the size of neural networks by selectively pruning components. Similarly to their …

Uložit Citovat Počet citací tohoto článku: 876 Související články Všechny verze (počet: 27) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Pruning and quantization for deep neural network acceleration: A survey

T Liang, J Glossner, L Wang, S Shi, X Zhang - Neurocomputing, 2021 - Elsevier

Deep neural networks have been applied in many applications exhibiting extraordinary
abilities in the field of computer vision. However, complex network architectures challenge …

Uložit Citovat Počet citací tohoto článku: 862 Související články Všechny verze (počet: 6)

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Adavit: Adaptive vision transformers for efficient image recognition

L Meng, H Li, BC Chen, S Lan, Z Wu… - Proceedings of the …, 2022 - openaccess.thecvf.com

Built on top of self-attention mechanisms, vision transformers have demonstrated
remarkable performance on a variety of vision tasks recently. While achieving excellent …

Uložit Citovat Počet citací tohoto článku: 259 Související články Všechny verze (počet: 5) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Mcunet: Tiny deep learning on iot devices

J Lin, WM Chen, Y Lin, C Gan… - Advances in neural …, 2020 - proceedings.neurips.cc

Abstract Machine learning on tiny IoT devices based on microcontroller units (MCU) is
appealing but challenging: the memory of microcontrollers is 2-3 orders of magnitude …

Uložit Citovat Počet citací tohoto článku: 613 Související články Všechny verze (počet: 9) Zobrazit jako HTML

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

Runtime neural pruning

Structured pruning for deep convolutional neural networks: A survey

A comprehensive survey on model compression and acceleration

Flashattention: Fast and memory-efficient exact attention with io-awareness

On-device training under 256kb memory

A-vit: Adaptive tokens for efficient vision transformer

Dynamic neural networks: A survey

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

Pruning and quantization for deep neural network acceleration: A survey

Adavit: Adaptive vision transformers for efficient image recognition

Mcunet: Tiny deep learning on iot devices