Efficient acceleration of deep learning inference on resource-constrained edge devices: A review
Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …
in breakthroughs in many areas. However, deploying these highly accurate models for data …
Towards the use of artificial intelligence on the edge in space systems: Challenges and opportunities
The market for remote sensing space-based applications is fundamentally limited by up-and
downlink bandwidth and onboard compute capability for space data handling systems. This …
downlink bandwidth and onboard compute capability for space data handling systems. This …
Survey of machine learning accelerators
New machine learning accelerators are being announced and released each month for a
variety of applications from speech recognition, video object detection, assisted driving, and …
variety of applications from speech recognition, video object detection, assisted driving, and …
Mixed precision algorithms in numerical linear algebra
Today's floating-point arithmetic landscape is broader than ever. While scientific computing
has traditionally used single precision and double precision floating-point arithmetics, half …
has traditionally used single precision and double precision floating-point arithmetics, half …
AI accelerator survey and trends
Over the past several years, new machine learning accelerators were being announced and
released every month for a variety of applications from speech recognition, video object …
released every month for a variety of applications from speech recognition, video object …
Gobo: Quantizing attention-based nlp models for low latency and energy efficient inference
Attention-based models have demonstrated remarkable success in various natural
language understanding tasks. However, efficient execution remains a challenge for these …
language understanding tasks. However, efficient execution remains a challenge for these …
Adaptive inference through early-exit networks: Design, challenges and directions
DNNs are becoming less and less over-parametrised due to recent advances in efficient
model design, through careful hand-crafted or NAS-based methods. Relying on the fact that …
model design, through careful hand-crafted or NAS-based methods. Relying on the fact that …
AI and ML accelerator survey and trends
This paper updates the survey of AI accelerators and processors from past three years. This
paper collects and summarizes the current commercial accelerators that have been publicly …
paper collects and summarizes the current commercial accelerators that have been publicly …
On-device deep learning for mobile and wearable sensing applications: A review
OD Incel, SÖ Bursa - IEEE Sensors Journal, 2023 - ieeexplore.ieee.org
Although running deep-learning (DL) algorithms is challenging due to resource constraints
on mobile and wearable devices, they provide performance improvements compared to …
on mobile and wearable devices, they provide performance improvements compared to …
Stochastic rounding: implementation, error analysis and applications
Stochastic rounding (SR) randomly maps a real number x to one of the two nearest values in
a finite precision number system. The probability of choosing either of these two numbers is …
a finite precision number system. The probability of choosing either of these two numbers is …