Recent advances in convolutional neural network acceleration

Q Zhang, M Zhang, T Chen, Z Sun, Y Ma, B Yu - Neurocomputing, 2019 - Elsevier
In recent years, convolutional neural networks (CNNs) have shown great performance in
various fields such as image classification, pattern recognition, and multi-media …

A survey of FPGA-based accelerators for convolutional neural networks

S Mittal - Neural computing and applications, 2020 - Springer
Deep convolutional neural networks (CNNs) have recently shown very high accuracy in a
wide range of cognitive tasks, and due to this, they have received significant interest from the …

A high-throughput and power-efficient FPGA implementation of YOLO CNN for object detection

DT Nguyen, TN Nguyen, H Kim… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) require numerous computations and external
memory accesses. Frequent accesses to off-chip memory cause slow processing and large …

DNNBuilder: An automated tool for building high-performance DNN hardware accelerators for FPGAs

X Zhang, J Wang, C Zhu, Y Lin, J **ong… - 2018 IEEE/ACM …, 2018 - ieeexplore.ieee.org
Building a high-performance FPGA accelerator for Deep Neural Networks (DNNs) often
requires RTL programming, hardware verification, and precise resource allocation, all of …

A survey of FPGA-based neural network accelerator

K Guo, S Zeng, J Yu, Y Wang, H Yang - arxiv preprint arxiv:1712.08934, 2017 - arxiv.org
Recent researches on neural network have shown significant advantage in machine
learning over traditional algorithms based on handcrafted features and models. Neural …

A CNN accelerator on FPGA using depthwise separable convolution

L Bai, Y Zhao, X Huang - … on Circuits and Systems II: Express …, 2018 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) have been widely deployed in the fields of computer
vision and pattern recognition because of their high accuracy. However, large convolution …

Flextensor: An automatic schedule exploration and optimization framework for tensor computation on heterogeneous system

S Zheng, Y Liang, S Wang, R Chen… - Proceedings of the Twenty …, 2020 - dl.acm.org
Tensor computation plays a paramount role in a broad range of domains, including machine
learning, data analytics, and scientific computing. The wide adoption of tensor computation …

Recent advances in efficient computation of deep convolutional neural networks

J Cheng, P Wang, G Li, Q Hu, H Lu - Frontiers of Information Technology & …, 2018 - Springer
Deep neural networks have evolved remarkably over the past few years and they are
currently the fundamental tools of many intelligent systems. At the same time, the …

Hw-nas-bench: Hardware-aware neural architecture search benchmark

C Li, Z Yu, Y Fu, Y Zhang, Y Zhao, H You, Q Yu… - arxiv preprint arxiv …, 2021 - arxiv.org
HardWare-aware Neural Architecture Search (HW-NAS) has recently gained tremendous
attention by automating the design of DNNs deployed in more resource-constrained daily …

A systematic literature review on hardware implementation of artificial intelligence algorithms

MA Talib, S Majzoub, Q Nasir, D Jamal - The Journal of Supercomputing, 2021 - Springer
Artificial intelligence (AI) and machine learning (ML) tools play a significant role in the recent
evolution of smart systems. AI solutions are pushing towards a significant shift in many fields …