FPGA-based accelerators of deep learning networks for learning and classification: A review
Due to recent advances in digital technologies, and availability of credible data, an area of
artificial intelligence, deep learning, has emerged and has demonstrated its ability and …
artificial intelligence, deep learning, has emerged and has demonstrated its ability and …
Efficient hardware architectures for accelerating deep neural networks: Survey
In the modern-day era of technology, a paradigm shift has been witnessed in the areas
involving applications of Artificial Intelligence (AI), Machine Learning (ML), and Deep …
involving applications of Artificial Intelligence (AI), Machine Learning (ML), and Deep …
AxBench: A multiplatform benchmark suite for approximate computing
Approximate computing is claimed to be a powerful knob for alleviating the peak power and
energy-efficiency issues. However, providing a consistent benchmark suit with diverse …
energy-efficiency issues. However, providing a consistent benchmark suit with diverse …
Fathom: Reference workloads for modern deep learning methods
Deep learning has been popularized by its recent successes on challenging artificial
intelligence problems. One of the reasons for its dominance is also an ongoing challenge …
intelligence problems. One of the reasons for its dominance is also an ongoing challenge …
A survey of machine learning for computer architecture and systems
It has been a long time that computer architecture and systems are optimized for efficient
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …
DeepMaker: A multi-objective optimization framework for deep neural networks in embedded systems
Abstract Deep Neural Networks (DNNs) are compute-intensive learning models with
growing applicability in a wide range of domains. Due to their computational complexity …
growing applicability in a wide range of domains. Due to their computational complexity …
Ganax: A unified mimd-simd acceleration for generative adversarial networks
Generative Adversarial Networks (GANs) are one of the most recent deep learning models
that generate synthetic data from limited genuine datasets. GANs are on the frontier as …
that generate synthetic data from limited genuine datasets. GANs are on the frontier as …
Approximate computing survey, Part II: Application-specific & architectural approximation techniques and applications
The challenging deployment of compute-intensive applications from domains such as
Artificial Intelligence (AI) and Digital Signal Processing (DSP), forces the community of …
Artificial Intelligence (AI) and Digital Signal Processing (DSP), forces the community of …
Accelerating attention through gradient-based learned runtime pruning
Self-attention is a key enabler of state-of-art accuracy for various transformer-based Natural
Language Processing models. This attention mechanism calculates a correlation score for …
Language Processing models. This attention mechanism calculates a correlation score for …
APPROX-NoC: A data approximation framework for network-on-chip architectures
The trend of unsustainable power consumption and large memory bandwidth demands in
massively parallel multicore systems, with the advent of the big data era, has brought upon …
massively parallel multicore systems, with the advent of the big data era, has brought upon …