LegUp: An open-source high-level synthesis tool for FPGA-based processor/accelerator systems
It is generally accepted that a custom hardware implementation of a set of computations will
provide superior speed and energy efficiency relative to a software implementation …
provide superior speed and energy efficiency relative to a software implementation …
PTask: operating system abstractions to manage GPUs as compute devices
We propose a new set of OS abstractions to support GPUs and other accelerator devices as
first class computing resources. These new abstractions, collectively called the PTask API …
first class computing resources. These new abstractions, collectively called the PTask API …
Floem: A programming system for {NIC-Accelerated} network applications
Develo** server applications that offload computation and data to a NIC accelerator is
laborious because one has to explore the design space of decisions about data placement …
laborious because one has to explore the design space of decisions about data placement …
Dandelion: a compiler and runtime for heterogeneous systems
Computer systems increasingly rely on heterogeneity to achieve greater performance,
scalability and energy efficiency. Because heterogeneous systems typically comprise …
scalability and energy efficiency. Because heterogeneous systems typically comprise …
Sorting networks on FPGAs
Computer architectures are quickly changing toward heterogeneous many-core systems.
Such a trend opens up interesting opportunities but also raises immense challenges since …
Such a trend opens up interesting opportunities but also raises immense challenges since …
Lime: a java-compatible and synthesizable language for heterogeneous architectures
J Auerbach, DF Bacon, P Cheng… - Proceedings of the ACM …, 2010 - dl.acm.org
The halt in clock frequency scaling has forced architects and language designers to look
elsewhere for continued improvements in performance. We believe that extracting maximum …
elsewhere for continued improvements in performance. We believe that extracting maximum …
FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs
A Papakonstantinou, K Gururaj… - 2009 IEEE 7th …, 2009 - ieeexplore.ieee.org
As growing power dissipation and thermal effects disrupted the rising clock frequency trend
and threatened to annul Moore's law, the computing industry has switched its route to higher …
and threatened to annul Moore's law, the computing industry has switched its route to higher …
Data processing on FPGAs
Computer architectures are quickly changing toward heterogeneous many-core systems.
Such a trend opens up interesting opportunities but also raises immense challenges since …
Such a trend opens up interesting opportunities but also raises immense challenges since …
Leap scratchpads: automatic memory and cache management for reconfigurable logic
Developers accelerating applications on FPGAs or other reconfigurable logic have nothing
but raw memory devices in their standard toolkits. Each project typically includes tedious …
but raw memory devices in their standard toolkits. Each project typically includes tedious …
High‐Level Synthesis: Productivity, Performance, and Software Constraints
FPGAs are an attractive platform for applications with high computation demand and low
energy consumption requirements. However, design effort for FPGA implementations …
energy consumption requirements. However, design effort for FPGA implementations …