Efficient acceleration of deep learning inference on resource-constrained edge devices: A review

MMH Shuvo, SK Islam, J Cheng… - Proceedings of the …, 2022 - ieeexplore.ieee.org
Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …

Towards the use of artificial intelligence on the edge in space systems: Challenges and opportunities

G Furano, G Meoni, A Dunne… - IEEE Aerospace and …, 2020 - ieeexplore.ieee.org
The market for remote sensing space-based applications is fundamentally limited by up-and
downlink bandwidth and onboard compute capability for space data handling systems. This …

Survey of machine learning accelerators

A Reuther, P Michaleas, M Jones… - 2020 IEEE high …, 2020 - ieeexplore.ieee.org
New machine learning accelerators are being announced and released each month for a
variety of applications from speech recognition, video object detection, assisted driving, and …

Mixed precision algorithms in numerical linear algebra

NJ Higham, T Mary - Acta Numerica, 2022 - cambridge.org
Today's floating-point arithmetic landscape is broader than ever. While scientific computing
has traditionally used single precision and double precision floating-point arithmetics, half …

AI accelerator survey and trends

A Reuther, P Michaleas, M Jones… - 2021 IEEE High …, 2021 - ieeexplore.ieee.org
Over the past several years, new machine learning accelerators were being announced and
released every month for a variety of applications from speech recognition, video object …

Gobo: Quantizing attention-based nlp models for low latency and energy efficient inference

AH Zadeh, I Edo, OM Awad… - 2020 53rd Annual IEEE …, 2020 - ieeexplore.ieee.org
Attention-based models have demonstrated remarkable success in various natural
language understanding tasks. However, efficient execution remains a challenge for these …

Adaptive inference through early-exit networks: Design, challenges and directions

S Laskaridis, A Kouris, ND Lane - … of the 5th International Workshop on …, 2021 - dl.acm.org
DNNs are becoming less and less over-parametrised due to recent advances in efficient
model design, through careful hand-crafted or NAS-based methods. Relying on the fact that …

AI and ML accelerator survey and trends

A Reuther, P Michaleas, M Jones… - 2022 IEEE High …, 2022 - ieeexplore.ieee.org
This paper updates the survey of AI accelerators and processors from past three years. This
paper collects and summarizes the current commercial accelerators that have been publicly …

On-device deep learning for mobile and wearable sensing applications: A review

OD Incel, SÖ Bursa - IEEE Sensors Journal, 2023 - ieeexplore.ieee.org
Although running deep-learning (DL) algorithms is challenging due to resource constraints
on mobile and wearable devices, they provide performance improvements compared to …

Stochastic rounding: implementation, error analysis and applications

M Croci, M Fasi, NJ Higham… - Royal Society Open …, 2022 - royalsocietypublishing.org
Stochastic rounding (SR) randomly maps a real number x to one of the two nearest values in
a finite precision number system. The probability of choosing either of these two numbers is …