Can FPGAs beat GPUs in accelerating next-generation deep neural networks?

E Nurvitadhi, G Venkatesh, J Sim, D Marr… - Proceedings of the …, 2017 - dl.acm.org
Current-generation Deep Neural Networks (DNNs), such as AlexNet and VGG, rely heavily
on dense floating-point matrix multiplication (GEMM), which maps well to GPUs (regular …

A bayesian hierarchical model for learning natural scene categories

L Fei-Fei, P Perona - … vision and pattern recognition (CVPR'05), 2005 - ieeexplore.ieee.org
We propose a novel approach to learn and recognize natural scene categories. Unlike
previous work, it does not require experts to annotate the training set. We represent the …

SPIRAL: Extreme performance portability

F Franchetti, TM Low, DT Popovici… - Proceedings of the …, 2018 - ieeexplore.ieee.org
In this paper, we address the question of how to automatically map computational kernels to
highly efficient code for a wide range of computing platforms and establish the correctness of …

Gapflyt: Active vision based minimalist structure-less gap detection for quadrotor flight

NJ Sanket, CD Singh, K Ganguly… - IEEE Robotics and …, 2018 - ieeexplore.ieee.org
Although quadrotors, and aerial robots in general, are inherently active agents, their
perceptual capabilities in literature so far have been mostly passive in nature. Researchers …

Fully integrated FPGA molecular dynamics simulations

C Yang, T Geng, T Wang, R Patel, Q **ong… - Proceedings of the …, 2019 - dl.acm.org
The implementation of Molecular Dynamics (MD) on FPGAs has received substantial
attention. Previous work, however, has consisted of either proof-of-concept implementations …

Avoiding game over: Bringing design to the next level

O Shacham, M Wachs, A Danowitz, S Galal… - Proceedings of the 49th …, 2012 - dl.acm.org
Technology scaling has created a catch-22: technology now can do almost anything we
want, but the NRE design costs are so high, that almost no one can afford to use it. Our …

Floating-point mixed-radix FFT core generation for FPGA and comparison with GPU and CPU

B Duan, W Wang, X Li, C Zhang… - … Conference on Field …, 2011 - ieeexplore.ieee.org
Over the past decades, we noticed huge advances in FPGA technologies. The topic of
floating-point accelerator on FPGA has gained renewed interests due to the increased …

3D FFTs on a Single FPGA

B Humphries, H Zhang, J Sheng… - 2014 IEEE 22nd …, 2014 - ieeexplore.ieee.org
The 3D FFT is critical in many physical simulations and image processing applications. On
FPGAs, however, the 3D FFT was thought to be inefficient relative to other methods such as …

Real-time 20.37 Gb/s optical OFDM receiver for PON IM/DD systems

JS Bruno, V Almenar, J Valls, JL Corral - Optics express, 2018 - opg.optica.org
This paper presents the hardware architecture of an OFDM receiver suitable for optical
communications. The receiver has been implemented in an FPGA device and used to …

Asynchronous event-based Fourier analysis

Q Sabatier, SH Ieng… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
This paper introduces a method to compute the FFT of a visual scene at a high temporal
precision of around 1-μs output from an asynchronous event-based camera. Event-based …