Co-designing accelerators and SoC interfaces using gem5-Aladdin
Increasing demand for power-efficient, high-performance computing has spurred a growing
number and diversity of hardware accelerators in mobile and server Systems on Chip …
number and diversity of hardware accelerators in mobile and server Systems on Chip …
Svelto: High-level synthesis of multi-threaded accelerators for graph analytics
Graph analytics are an emerging class of irregular applications. Operating on very large
datasets, they present unique behaviors, such as fine-grained, unpredictable memory …
datasets, they present unique behaviors, such as fine-grained, unpredictable memory …
SPARTA: High-Level Synthesis of Parallel Multi-Threaded Accelerators
This article presents a methodology for the Synthesis of PARallel multi-Threaded
Accelerators (SPARTA) from OpenMP annotated C/C++ specifications. SPARTA extends an …
Accelerators (SPARTA) from OpenMP annotated C/C++ specifications. SPARTA extends an …
Scaling performance for N-body stream computation with a ring of FPGAs
Field-Programmable Gate Arrays (FPGAs) offer a fairly non-invasive method to specialize
custom architectures towards a specific application domain. Recent studies has successfully …
custom architectures towards a specific application domain. Recent studies has successfully …
Design and modeling of specialized architectures
Y Shao - 2016 - dash.harvard.edu
Hardware acceleration in the form of customized datapath and control circuitry tuned to
specific applications has gained popularity for its promise to utilize transistors more …
specific applications has gained popularity for its promise to utilize transistors more …
Synthesis of interleaved multithreaded accelerators from OpenMP loops
Similarly to CPUs and GPUs, FPGA-based accelerators can also profit from exploiting thread-
level parallelism. Thus, the synthesis tools for generating the circuits from high-level …
level parallelism. Thus, the synthesis tools for generating the circuits from high-level …
Design of OpenCL-compatible multithreaded hardware accelerators with dynamic support for embedded FPGAs
A Rodrıguez, J Valverde… - … Computing and FPGAs …, 2015 - ieeexplore.ieee.org
ARTICo 3 is an architecture that permits to dynamically set an arbitrary number of
reconfigurable hardware accelerators, each containing a given number of threads fixed at …
reconfigurable hardware accelerators, each containing a given number of threads fixed at …
Multithread accelerators on fpgas: A dataflow-based approach
Multithreading is a well-known technique for general-purpose systems to deliver a
substantial performance gain, raising resource efficiency by exploiting underutilization …
substantial performance gain, raising resource efficiency by exploiting underutilization …
Optimized high-level synthesis of SMT multi-threaded hardware accelerators
Recent high-level synthesis tools offer the capability to generate multi-threaded micro-
architectures to hide memory access latencies. In many HLS flows, this is often achieved by …
architectures to hide memory access latencies. In many HLS flows, this is often achieved by …
Area-Efficient Memory Scheduling for Dynamically Scheduled High-Level Synthesis
X He, J Cheng… - … Conference on Field …, 2022 - ieeexplore.ieee.org
In high-level synthesis, scheduling maps operations into clock cycles. It can either be done
at compile time (statically) or run time (dynamically). There has been recent interests in …
at compile time (statically) or run time (dynamically). There has been recent interests in …