FPGA-based accelerators of deep learning networks for learning and classification: A review
Due to recent advances in digital technologies, and availability of credible data, an area of
artificial intelligence, deep learning, has emerged and has demonstrated its ability and …
artificial intelligence, deep learning, has emerged and has demonstrated its ability and …
Evaluating fast algorithms for convolutional neural networks on FPGAs
In recent years, Convolutional Neural Networks (CNNs) have become widely adopted for
computer vision tasks. FPGAs have been adequately explored as a promising hardware …
computer vision tasks. FPGAs have been adequately explored as a promising hardware …
Exploring heterogeneous algorithms for accelerating deep convolutional neural networks on FPGAs
Convolutional neural network (CNN) finds applications in a variety of computer vision
applications ranging from object recognition and detection to scene understanding owing to …
applications ranging from object recognition and detection to scene understanding owing to …
CHARM: C omposing H eterogeneous A ccele R ators for M atrix Multiply on Versal ACAP Architecture
Dense matrix multiply (MM) serves as one of the most heavily used kernels in deep learning
applications. To cope with the high computation demands of these applications …
applications. To cope with the high computation demands of these applications …
Evaluating fast algorithms for convolutional neural networks on FPGAs
In recent years, convolutional neural networks (CNNs) have become widely adopted for
computer vision tasks. Field-programmable gate arrays (FPGAs) have been adequately …
computer vision tasks. Field-programmable gate arrays (FPGAs) have been adequately …
REQ-YOLO: A resource-aware, efficient quantization framework for object detection on FPGAs
Deep neural networks (DNNs), as the basis of object detection, will play a key role in the
development of future autonomous systems with full autonomy. The autonomous systems …
development of future autonomous systems with full autonomy. The autonomous systems …
Rosetta: A realistic high-level synthesis benchmark suite for software programmable FPGAs
Modern high-level synthesis (HLS) tools greatly reduce the turn-around time of designing
and implementing complex FPGA-based accelerators. They also expose various …
and implementing complex FPGA-based accelerators. They also expose various …
SpWA: An efficient sparse winograd convolutional neural networks accelerator on FPGAs
FPGAs have been an efficient accelerator for CNN inference due to its high performance,
flexibility, and energy-efficiency. To improve the performance of CNNs on FPGAs, fast …
flexibility, and energy-efficiency. To improve the performance of CNNs on FPGAs, fast …
Predictable accelerator design with time-sensitive affine types
Field-programmable gate arrays (FPGAs) provide an opportunity to co-design applications
with hardware accelerators, yet they remain difficult to program. High-level synthesis (HLS) …
with hardware accelerators, yet they remain difficult to program. High-level synthesis (HLS) …
Decoding small surface codes with feedforward neural networks
Surface codes reach high error thresholds when decoded with known algorithms, but the
decoding time will likely exceed the available time budget, especially for near-term …
decoding time will likely exceed the available time budget, especially for near-term …