Co-designing accelerators and SoC interfaces using gem5-Aladdin

YS Shao, SL **, V Srinivasan, GY Wei… - 2016 49th Annual …, 2016 - ieeexplore.ieee.org
Increasing demand for power-efficient, high-performance computing has spurred a growing
number and diversity of hardware accelerators in mobile and server Systems on Chip …

Svelto: High-level synthesis of multi-threaded accelerators for graph analytics

M Minutoli, VG Castellana, N Saporetti… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Graph analytics are an emerging class of irregular applications. Operating on very large
datasets, they present unique behaviors, such as fine-grained, unpredictable memory …

SPARTA: High-Level Synthesis of Parallel Multi-Threaded Accelerators

G Gozzi, M Fiorito, S Curzel, C Barone… - ACM Transactions on …, 2024 - dl.acm.org
This article presents a methodology for the Synthesis of PARallel multi-Threaded
Accelerators (SPARTA) from OpenMP annotated C/C++ specifications. SPARTA extends an …

Scaling performance for N-body stream computation with a ring of FPGAs

J Huthmann, A Shin, A Podobas, K Sano… - Proceedings of the 10th …, 2019 - dl.acm.org
Field-Programmable Gate Arrays (FPGAs) offer a fairly non-invasive method to specialize
custom architectures towards a specific application domain. Recent studies has successfully …

Design and modeling of specialized architectures

Y Shao - 2016 - dash.harvard.edu
Hardware acceleration in the form of customized datapath and control circuitry tuned to
specific applications has gained popularity for its promise to utilize transistors more …

Synthesis of interleaved multithreaded accelerators from OpenMP loops

L Sommer, J Oppermann, J Hofmann… - … Computing and FPGAs …, 2017 - ieeexplore.ieee.org
Similarly to CPUs and GPUs, FPGA-based accelerators can also profit from exploiting thread-
level parallelism. Thus, the synthesis tools for generating the circuits from high-level …

Design of OpenCL-compatible multithreaded hardware accelerators with dynamic support for embedded FPGAs

A Rodrıguez, J Valverde… - … Computing and FPGAs …, 2015 - ieeexplore.ieee.org
ARTICo 3 is an architecture that permits to dynamically set an arbitrary number of
reconfigurable hardware accelerators, each containing a given number of threads fixed at …

Multithread accelerators on fpgas: A dataflow-based approach

F Ratto, S Esposito, C Sau, L Raffo… - 13th Workshop on …, 2022 - drops.dagstuhl.de
Multithreading is a well-known technique for general-purpose systems to deliver a
substantial performance gain, raising resource efficiency by exploiting underutilization …

Optimized high-level synthesis of SMT multi-threaded hardware accelerators

J Huthmann, A Koch - 2015 International Conference on Field …, 2015 - ieeexplore.ieee.org
Recent high-level synthesis tools offer the capability to generate multi-threaded micro-
architectures to hide memory access latencies. In many HLS flows, this is often achieved by …

Area-Efficient Memory Scheduling for Dynamically Scheduled High-Level Synthesis

X He, J Cheng… - … Conference on Field …, 2022 - ieeexplore.ieee.org
In high-level synthesis, scheduling maps operations into clock cycles. It can either be done
at compile time (statically) or run time (dynamically). There has been recent interests in …