A survey on hardware accelerator design of deep learning for edge devices

A Samanta, I Hatai, AK Mal - Wireless Personal Communications, 2024 - Springer
In artificial intelligence, the large role is played by machine learning (ML) in a variety of
applications. This article aims at providing a comprehensive survey on summarizing recent …

[HTML][HTML] A High-Performance and Ultra-Low-Power Accelerator Design for Advanced Deep Learning Algorithms on an FPGA

A Gundrapally, YA Shah, N Alnatsheh, KK Choi - Electronics, 2024 - mdpi.com
This article addresses the growing need in resource-constrained edge computing scenarios
for energy-efficient convolutional neural network (CNN) accelerators on mobile Field …

A High-Performance Pixel-Level Fully Pipelined Hardware Accelerator for Neural Networks

Z Li, Z Zhang, J Hu, Q Meng, X Shi… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
The design of convolutional neural network (CNN) hardware accelerators based on a single
computing engine (CE) architecture or multi-CE architecture has received widespread …

Algorithm–Hardware Co-Optimization and Deployment Method for Field-Programmable Gate-Array-Based Convolutional Neural Network Remote Sensing Image …

S Ni, X Wei, N Zhang, H Chen - Remote Sensing, 2023 - mdpi.com
In recent years, convolutional neural networks (CNNs) have gained widespread adoption in
remote sensing image processing. Deploying CNN-based algorithms on satellite edge …

Design Space Exploration for Edge Machine Learning featured by MathWorks FPGA DL Processor: A Survey

S Bertazzoni, L Canese, GC Cardarilli… - IEEE …, 2024 - ieeexplore.ieee.org
This paper proposes a Design Space Exploration for Edge machine learning through the
utilization of the novel MathWorks FPGA Deep Learning Processor IP, featured in the HDL …

Synchronizing Object Detection: Applications, Advancements and Existing Challenges

MT Hosain, A Zaman, MR Abir, S Akter… - IEEE …, 2024 - ieeexplore.ieee.org
From pivotal roles in autonomous vehicles, healthcare diagnostics, and surveillance
systems to seamlessly integrating with augmented reality, object detection algorithms stand …

Design and implementation of FPGA based system for object detection and range estimation used in ADAS applications utilizing FMCW radar

M Khan, P Mahajan, GN Khan… - … on Circuits and …, 2024 - ieeexplore.ieee.org
This paper presents the design and implementation of a hardware system for real-time
object detection and range estimation utilizing Frequency-Modulated Continuous Wave …

ADS-CNN: Adaptive Dataflow Scheduling for lightweight CNN accelerator on FPGAs

Y Wan, X **e, J Chen, K **e, D Yi, Y Lu, K Gai - Future Generation …, 2024 - Elsevier
Lightweight convolutional neural networks (CNNs) enable lower inference latency and data
traffic, facilitating deployment on resource-constrained edge devices such as field …

HePiLUT: Resource Efficient Heterogeneous Pipelined CNN Accelerator for FPGAs

R Al Amin, MSA Hossain… - … on Intelligent Computing …, 2024 - ieeexplore.ieee.org
Field programmable gate arrays (FPGAs) have gained recognition as a suitable platform for
implementing Convolutional Neural Network (CNN)-based algorithms due to their favorable …

PEFSL: A deployment Pipeline for Embedded Few-Shot Learning on a FPGA SoC

L Grativol, L Gauthier, M Leonardon… - … on Circuits and …, 2024 - ieeexplore.ieee.org
This paper tackles the challenges of implementing few-shot learning on embedded systems,
specifically FPGA SoCs, a vital approach for adapting to diverse classification tasks …