Accelerating CNN inference on ASICs: A survey

D Moolchandani, A Kumar, SR Sarangi - Journal of Systems Architecture, 2021 - Elsevier
Convolutional neural networks (CNNs) have proven to be a disruptive technology in most
vision, speech and image processing tasks. Given their ubiquitous acceptance, the research …

Graphics processing units in bioinformatics, computational biology and systems biology

MS Nobile, P Cazzaniga, A Tangherloni… - Briefings in …, 2017 - academic.oup.com
Abstract Several studies in Bioinformatics, Computational Biology and Systems Biology rely
on the definition of physico-chemical or mathematical models of biological systems at …

Comparing energy efficiency of CPU, GPU and FPGA implementations for vision kernels

M Qasaimeh, K Denolf, J Lo, K Vissers… - … and systems (ICESS …, 2019 - ieeexplore.ieee.org
Develo** high performance embedded vision applications requires balancing run-time
performance with energy constraints. Given the mix of hardware accelerators that exist for …

Rodinia: A benchmark suite for heterogeneous computing

S Che, M Boyer, J Meng, D Tarjan… - 2009 IEEE …, 2009 - ieeexplore.ieee.org
This paper presents and characterizes Rodinia, a benchmark suite for heterogeneous
computing. To help architects study emerging platforms such as GPUs (Graphics Processing …

Evaluating and optimizing OpenCL kernels for high performance computing with FPGAs

HR Zohouri, N Maruyama, A Smith… - SC'16: Proceedings …, 2016 - ieeexplore.ieee.org
We evaluate the power and performance of the Rodinia benchmark suite using the Altera
SDK for OpenCL targeting a Stratix V FPGA against a modern CPU and GPU. We study …

A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads

S Che, JW Sheaffer, M Boyer… - IEEE International …, 2010 - ieeexplore.ieee.org
The recently released Rodinia benchmark suite enables users to evaluate heterogeneous
systems including both accelerators, such as GPUs, and multicore CPUs. As Rodinia sees …

A performance and energy comparison of FPGAs, GPUs, and multicores for sliding-window applications

J Fowers, G Brown, P Cooke, G Stitt - Proceedings of the ACM/SIGDA …, 2012 - dl.acm.org
With the emergence of accelerator devices such as multicores, graphics-processing units
(GPUs), and field-programmable gate arrays (FPGAs), application designers are confronted …

Understanding performance differences of FPGAs and GPUs

J Cong, Z Fang, M Lo, H Wang, J Xu… - 2018 IEEE 26th Annual …, 2018 - ieeexplore.ieee.org
This paper aims to better understand the performance differences between FPGAs and
GPUs. We intentionally begin with a widely used GPU-friendly benchmark suite, Rodinia …

State-of-the-art in heterogeneous computing

AR Brodtkorb, C Dyken, TR Hagen… - Scientific …, 2010 - content.iospress.com
Node level heterogeneous architectures have become attractive during the last decade for
several reasons: compared to traditional symmetric CPUs, they offer high peak performance …

Single-chip heterogeneous computing: Does the future include custom logic, FPGAs, and GPGPUs?

ES Chung, PA Milder, JC Hoe… - 2010 43rd annual IEEE …, 2010 - ieeexplore.ieee.org
To extend the exponential performance scaling of future chip multiprocessors, improving
energy efficiency has become a first-class priority. Single-chip heterogeneous computing …