Machine learning on big data: Opportunities and challenges

L Zhou, S Pan, J Wang, AV Vasilakos - Neurocomputing, 2017 - Elsevier
Abstract Machine learning (ML) is continuously unleashing its power in a wide range of
applications. It has been pushed to the forefront in recent years partly owing to the advent of …

A survey of FPGA-based accelerators for convolutional neural networks

S Mittal - Neural computing and applications, 2020 - Springer
Deep convolutional neural networks (CNNs) have recently shown very high accuracy in a
wide range of cognitive tasks, and due to this, they have received significant interest from the …

PUMA: A programmable ultra-efficient memristor-based accelerator for machine learning inference

A Ankit, IE Hajj, SR Chalamalasetti, G Ndu… - Proceedings of the …, 2019 - dl.acm.org
Memristor crossbars are circuits capable of performing analog matrix-vector multiplications,
overcoming the fundamental energy efficiency limitations of digital logic. They have been …

Pipelayer: A pipelined reram-based accelerator for deep learning

L Song, X Qian, H Li, Y Chen - 2017 IEEE international …, 2017 - ieeexplore.ieee.org
Convolution neural networks (CNNs) are the heart of deep learning applications. Recent
works PRIME [1] and ISAAC [2] demonstrated the promise of using resistive random access …

The architectural implications of autonomous driving: Constraints and acceleration

SC Lin, Y Zhang, CH Hsu, M Skach… - Proceedings of the …, 2018 - dl.acm.org
Autonomous driving systems have attracted a significant amount of interest recently, and
many industry leaders, such as Google, Uber, Tesla, and Mobileye, have invested a large …

From high-level deep neural models to FPGAs

H Sharma, J Park, D Mahajan, E Amaro… - 2016 49th Annual …, 2016 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) are compute-intensive learning models with growing
applicability in a wide range of domains. FPGAs are an attractive choice for DNNs since they …

Rethinking software runtimes for disaggregated memory

I Calciu, MT Imran, I Puddu, S Kashyap… - Proceedings of the 26th …, 2021 - dl.acm.org
Disaggregated memory can address resource provisioning inefficiencies in current
datacenters. Multiple software runtimes for disaggregated memory have been proposed in …

FP-DNN: An automated framework for map** deep neural networks onto FPGAs with RTL-HLS hybrid templates

Y Guan, H Liang, N Xu, W Wang, S Shi… - 2017 IEEE 25th …, 2017 - ieeexplore.ieee.org
DNNs (Deep Neural Networks) have demonstrated great success in numerous applications
such as image classification, speech recognition, video analysis, etc. However, DNNs are …

Graphicionado: A high-performance and energy-efficient accelerator for graph analytics

TJ Ham, L Wu, N Sundaram, N Satish… - 2016 49th annual …, 2016 - ieeexplore.ieee.org
Graphs are one of the key data structures for many real-world computing applications and
the importance of graph analytics is ever-growing. While existing software graph processing …

AutoSA: A polyhedral compiler for high-performance systolic arrays on FPGA

J Wang, L Guo, J Cong - The 2021 ACM/SIGDA International Symposium …, 2021 - dl.acm.org
While systolic array architectures have the potential to deliver tremendous performance, it is
notoriously challenging to customize an efficient systolic array processor for a target …