FPGA-based accelerators of deep learning networks for learning and classification: A review

A Shawahna, SM Sait, A El-Maleh - ieee Access, 2018 - ieeexplore.ieee.org
Due to recent advances in digital technologies, and availability of credible data, an area of
artificial intelligence, deep learning, has emerged and has demonstrated its ability and …

Efficient hardware architectures for accelerating deep neural networks: Survey

P Dhilleswararao, S Boppu, MS Manikandan… - IEEE …, 2022 - ieeexplore.ieee.org
In the modern-day era of technology, a paradigm shift has been witnessed in the areas
involving applications of Artificial Intelligence (AI), Machine Learning (ML), and Deep …

AxBench: A multiplatform benchmark suite for approximate computing

A Yazdanbakhsh, D Mahajan… - IEEE Design & …, 2016 - ieeexplore.ieee.org
Approximate computing is claimed to be a powerful knob for alleviating the peak power and
energy-efficiency issues. However, providing a consistent benchmark suit with diverse …

Fathom: Reference workloads for modern deep learning methods

R Adolf, S Rama, B Reagen, GY Wei… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
Deep learning has been popularized by its recent successes on challenging artificial
intelligence problems. One of the reasons for its dominance is also an ongoing challenge …

A survey of machine learning for computer architecture and systems

N Wu, Y **e - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
It has been a long time that computer architecture and systems are optimized for efficient
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …

DeepMaker: A multi-objective optimization framework for deep neural networks in embedded systems

M Loni, S Sinaei, A Zoljodi, M Daneshtalab… - Microprocessors and …, 2020 - Elsevier
Abstract Deep Neural Networks (DNNs) are compute-intensive learning models with
growing applicability in a wide range of domains. Due to their computational complexity …

Ganax: A unified mimd-simd acceleration for generative adversarial networks

A Yazdanbakhsh, H Falahati, PJ Wolfe… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org
Generative Adversarial Networks (GANs) are one of the most recent deep learning models
that generate synthetic data from limited genuine datasets. GANs are on the frontier as …

Approximate computing survey, Part II: Application-specific & architectural approximation techniques and applications

V Leon, MA Hanif, G Armeniakos, X Jiao… - ACM Computing …, 2023 - dl.acm.org
The challenging deployment of compute-intensive applications from domains such as
Artificial Intelligence (AI) and Digital Signal Processing (DSP), forces the community of …

Accelerating attention through gradient-based learned runtime pruning

Z Li, S Ghodrati, A Yazdanbakhsh… - Proceedings of the 49th …, 2022 - dl.acm.org
Self-attention is a key enabler of state-of-art accuracy for various transformer-based Natural
Language Processing models. This attention mechanism calculates a correlation score for …

APPROX-NoC: A data approximation framework for network-on-chip architectures

R Boyapati, J Huang, P Majumder, KH Yum… - Proceedings of the 44th …, 2017 - dl.acm.org
The trend of unsustainable power consumption and large memory bandwidth demands in
massively parallel multicore systems, with the advent of the big data era, has brought upon …