LegUp: An open-source high-level synthesis tool for FPGA-based processor/accelerator systems

A Canis, J Choi, M Aldham, V Zhang… - ACM Transactions on …, 2013 - dl.acm.org
It is generally accepted that a custom hardware implementation of a set of computations will
provide superior speed and energy efficiency relative to a software implementation …

PTask: operating system abstractions to manage GPUs as compute devices

CJ Rossbach, J Currey, M Silberstein, B Ray… - Proceedings of the …, 2011 - dl.acm.org
We propose a new set of OS abstractions to support GPUs and other accelerator devices as
first class computing resources. These new abstractions, collectively called the PTask API …

Floem: A programming system for {NIC-Accelerated} network applications

PM Phothilimthana, M Liu, A Kaufmann… - … USENIX Symposium on …, 2018 - usenix.org
Develo** server applications that offload computation and data to a NIC accelerator is
laborious because one has to explore the design space of decisions about data placement …

Dandelion: a compiler and runtime for heterogeneous systems

CJ Rossbach, Y Yu, J Currey, JP Martin… - Proceedings of the …, 2013 - dl.acm.org
Computer systems increasingly rely on heterogeneity to achieve greater performance,
scalability and energy efficiency. Because heterogeneous systems typically comprise …

Sorting networks on FPGAs

R Mueller, J Teubner, G Alonso - The VLDB Journal, 2012 - Springer
Computer architectures are quickly changing toward heterogeneous many-core systems.
Such a trend opens up interesting opportunities but also raises immense challenges since …

Lime: a java-compatible and synthesizable language for heterogeneous architectures

J Auerbach, DF Bacon, P Cheng… - Proceedings of the ACM …, 2010 - dl.acm.org
The halt in clock frequency scaling has forced architects and language designers to look
elsewhere for continued improvements in performance. We believe that extracting maximum …

FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs

A Papakonstantinou, K Gururaj… - 2009 IEEE 7th …, 2009 - ieeexplore.ieee.org
As growing power dissipation and thermal effects disrupted the rising clock frequency trend
and threatened to annul Moore's law, the computing industry has switched its route to higher …

Data processing on FPGAs

R Mueller, J Teubner, G Alonso - Proceedings of the VLDB Endowment, 2009 - dl.acm.org
Computer architectures are quickly changing toward heterogeneous many-core systems.
Such a trend opens up interesting opportunities but also raises immense challenges since …

Leap scratchpads: automatic memory and cache management for reconfigurable logic

M Adler, KE Fleming, A Parashar, M Pellauer… - Proceedings of the 19th …, 2011 - dl.acm.org
Developers accelerating applications on FPGAs or other reconfigurable logic have nothing
but raw memory devices in their standard toolkits. Each project typically includes tedious …

High‐Level Synthesis: Productivity, Performance, and Software Constraints

Y Liang, K Rupnow, Y Li, D Min… - Journal of Electrical …, 2012 - Wiley Online Library
FPGAs are an attractive platform for applications with high computation demand and low
energy consumption requirements. However, design effort for FPGA implementations …