- Academic Search

D Ghimire, D Kil, S Kim - Electronics, 2022 - mdpi.com

Over the past decade, deep-learning-based representations have demonstrated remarkable
performance in academia and industry. The learning capability of convolutional neural …

Uložit Citovat Počet citací tohoto článku: 187 Související články Všechny verze (počet: 5) Archiv

Model compression and hardware acceleration for neural networks: A comprehensive survey

L Deng, G Li, S Han, L Shi, Y **e - Proceedings of the IEEE, 2020 - ieeexplore.ieee.org

Domain-specific hardware is becoming a promising topic in the backdrop of improvement
slow down for general-purpose processors due to the foreseeable end of Moore's Law …

Uložit Citovat Počet citací tohoto článku: 987 Související články Všechny verze (počet: 2)

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

T Hoefler, D Alistarh, T Ben-Nun, N Dryden… - Journal of Machine …, 2021 - jmlr.org

The growing energy and performance costs of deep learning have driven the community to
reduce the size of neural networks by selectively pruning components. Similarly to their …

Uložit Citovat Počet citací tohoto článku: 878 Související články Všechny verze (počet: 27) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Spatten: Efficient sparse attention architecture with cascade token and head pruning

H Wang, Z Zhang, S Han - 2021 IEEE International Symposium …, 2021 - ieeexplore.ieee.org

The attention mechanism is becoming increasingly popular in Natural Language Processing
(NLP) applications, showing superior performance than convolutional and recurrent …

Uložit Citovat Počet citací tohoto článku: 397 Související články Všechny verze (počet: 6)

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Efficient acceleration of deep learning inference on resource-constrained edge devices: A review

MMH Shuvo, SK Islam, J Cheng… - Proceedings of the …, 2022 - ieeexplore.ieee.org

Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …

Uložit Citovat Počet citací tohoto článku: 147 Související články Všechny verze (počet: 5)

[Free GPT-4]
[DeepSeek]

[PDF] gatech.edu

Sigma: A sparse and irregular gemm accelerator with flexible interconnects for dnn training

E Qin, A Samajdar, H Kwon, V Nadella… - … Symposium on High …, 2020 - ieeexplore.ieee.org

The advent of Deep Learning (DL) has radically transformed the computing industry across
the entire spectrum from algorithms to circuits. As myriad application domains embrace DL, it …

Uložit Citovat Počet citací tohoto článku: 494 Související články Všechny verze (počet: 7)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Eyeriss v2: A flexible accelerator for emerging deep neural networks on mobile devices

YH Chen, TJ Yang, J Emer… - IEEE Journal on Emerging …, 2019 - ieeexplore.ieee.org

A recent trend in deep neural network (DNN) development is to extend the reach of deep
learning applications to platforms that are more resource and energy-constrained, eg …

Uložit Citovat Počet citací tohoto článku: 1113 Související články Všechny verze (počet: 6)

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Amc: Automl for model compression and acceleration on mobile devices

Y He, J Lin, Z Liu, H Wang, LJ Li… - Proceedings of the …, 2018 - openaccess.thecvf.com

Abstract Model compression is an effective technique to efficiently deploy neural network
models on mobile devices which have limited computation resources and tight power …

Uložit Citovat Počet citací tohoto článku: 1735 Související články Všechny verze (počet: 13) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] umich.edu

Machine learning at facebook: Understanding inference at the edge

CJ Wu, D Brooks, K Chen, D Chen… - … symposium on high …, 2019 - ieeexplore.ieee.org

At Facebook, machine learning provides a wide range of capabilities that drive many
aspects of user experience including ranking posts, content understanding, object detection …

Uložit Citovat Počet citací tohoto článku: 583 Související články Všechny verze (počet: 6)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Machine learning at the network edge: A survey

MGS Murshed, C Murphy, D Hou, N Khan… - ACM Computing …, 2021 - dl.acm.org

Resource-constrained IoT devices, such as sensors and actuators, have become ubiquitous
in recent years. This has led to the generation of large quantities of data in real-time, which …

Uložit Citovat Počet citací tohoto článku: 490 Související články Všechny verze (počet: 6)

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

SCNN: An accelerator for compressed-sparse convolutional neural networks

A survey on efficient convolutional neural networks and hardware acceleration

Model compression and hardware acceleration for neural networks: A comprehensive survey

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

Spatten: Efficient sparse attention architecture with cascade token and head pruning

Efficient acceleration of deep learning inference on resource-constrained edge devices: A review

Sigma: A sparse and irregular gemm accelerator with flexible interconnects for dnn training

Eyeriss v2: A flexible accelerator for emerging deep neural networks on mobile devices

Amc: Automl for model compression and acceleration on mobile devices

Machine learning at facebook: Understanding inference at the edge

Machine learning at the network edge: A survey