Accelerating CNN inference on ASICs: A survey
Convolutional neural networks (CNNs) have proven to be a disruptive technology in most
vision, speech and image processing tasks. Given their ubiquitous acceptance, the research …
vision, speech and image processing tasks. Given their ubiquitous acceptance, the research …
Graphics processing units in bioinformatics, computational biology and systems biology
Abstract Several studies in Bioinformatics, Computational Biology and Systems Biology rely
on the definition of physico-chemical or mathematical models of biological systems at …
on the definition of physico-chemical or mathematical models of biological systems at …
Comparing energy efficiency of CPU, GPU and FPGA implementations for vision kernels
Develo** high performance embedded vision applications requires balancing run-time
performance with energy constraints. Given the mix of hardware accelerators that exist for …
performance with energy constraints. Given the mix of hardware accelerators that exist for …
Rodinia: A benchmark suite for heterogeneous computing
This paper presents and characterizes Rodinia, a benchmark suite for heterogeneous
computing. To help architects study emerging platforms such as GPUs (Graphics Processing …
computing. To help architects study emerging platforms such as GPUs (Graphics Processing …
Evaluating and optimizing OpenCL kernels for high performance computing with FPGAs
We evaluate the power and performance of the Rodinia benchmark suite using the Altera
SDK for OpenCL targeting a Stratix V FPGA against a modern CPU and GPU. We study …
SDK for OpenCL targeting a Stratix V FPGA against a modern CPU and GPU. We study …
A characterization of the Rodinia benchmark suite with comparison to contemporary CMP workloads
The recently released Rodinia benchmark suite enables users to evaluate heterogeneous
systems including both accelerators, such as GPUs, and multicore CPUs. As Rodinia sees …
systems including both accelerators, such as GPUs, and multicore CPUs. As Rodinia sees …
A performance and energy comparison of FPGAs, GPUs, and multicores for sliding-window applications
With the emergence of accelerator devices such as multicores, graphics-processing units
(GPUs), and field-programmable gate arrays (FPGAs), application designers are confronted …
(GPUs), and field-programmable gate arrays (FPGAs), application designers are confronted …
Understanding performance differences of FPGAs and GPUs
This paper aims to better understand the performance differences between FPGAs and
GPUs. We intentionally begin with a widely used GPU-friendly benchmark suite, Rodinia …
GPUs. We intentionally begin with a widely used GPU-friendly benchmark suite, Rodinia …
State-of-the-art in heterogeneous computing
Node level heterogeneous architectures have become attractive during the last decade for
several reasons: compared to traditional symmetric CPUs, they offer high peak performance …
several reasons: compared to traditional symmetric CPUs, they offer high peak performance …
Single-chip heterogeneous computing: Does the future include custom logic, FPGAs, and GPGPUs?
To extend the exponential performance scaling of future chip multiprocessors, improving
energy efficiency has become a first-class priority. Single-chip heterogeneous computing …
energy efficiency has become a first-class priority. Single-chip heterogeneous computing …