Pushing the level of abstraction of digital system design: A survey on how to program fpgas

ED Sozzo, D Conficconi, A Zeni, M Salaris… - ACM Computing …, 2022 - dl.acm.org
Field Programmable Gate Arrays (FPGAs) are spatial architectures with a heterogeneous
reconfigurable fabric. They are state-of-the-art for prototy**, telecommunications …

Tiramisu: A polyhedral compiler for expressing fast and portable code

R Baghdadi, J Ray, MB Romdhane… - 2019 IEEE/ACM …, 2019 - ieeexplore.ieee.org
This paper introduces Tiramisu, a polyhedral framework designed to generate high
performance code for multiple platforms including multicores, GPUs, and distributed …

Stateful dataflow multigraphs: A data-centric model for performance portability on heterogeneous architectures

T Ben-Nun, J de Fine Licht, AN Ziogas… - Proceedings of the …, 2019 - dl.acm.org
The ubiquity of accelerators in high-performance computing has driven programming
complexity beyond the skill-set of the average domain scientist. To maintain performance …

An MLIR-based compiler flow for system-level design and hardware acceleration

NB Agostini, S Curzel, V Amatya, C Tan… - Proceedings of the 41st …, 2022 - dl.acm.org
The generation of custom hardware accelerators for applications implemented within high-
level productive programming frameworks requires considerable manual effort. To automate …

AnyHLS: High-level synthesis with partial evaluation

MA Özkan, A Pérard-Gayot, R Membarth… - … on Computer-Aided …, 2020 - ieeexplore.ieee.org
Field programmable gate arrays (FPGAs) excel in low power and high throughput
computations, but they are challenging to program. Traditionally, developers rely on …

Towards automatic high-level code deployment on reconfigurable platforms: A survey of high-level synthesis tools and toolchains

MW Numan, BJ Phillips, GS Puddy, K Falkner - IEEE Access, 2020 - ieeexplore.ieee.org
Heterogeneous computing systems with tightly coupled processors and reconfigurable logic
blocks provide great scope to improve software performance by executing each section of …

AXI4MLIR: User-Driven Automatic Host Code Generation for Custom AXI-Based Accelerators

NB Agostini, J Haris, P Gibson… - 2024 IEEE/ACM …, 2024 - ieeexplore.ieee.org
This paper addresses the need for automatic and efficient generation of host driver code for
arbitrary custom AXI-based accelerators targeting linear algebra algorithms, an important …

A computational stack for cross-domain acceleration

S Kinzer, JK Kim, S Ghodrati, B Yatham… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Domain-specific accelerators obtain performance benefits by restricting their algorithmic
domain. These accelerators utilize specialized languages constrained to particular …

Popa: Expressing high and portable performance across spatial and vector architectures for tensor computations

X Hao, H Rong, M Zhang, C Sun, H Jiang… - Proceedings of the 2024 …, 2024 - dl.acm.org
This paper aims at high and portable performance for tensor computations across spatial
(eg, FPGAs) and vector architectures (eg, GPUs). The state-of-the-art usually address …

FLOWER: A comprehensive dataflow compiler for high-level synthesis

P Amiri, A Pérard-Gayot, R Membarth… - … Conference on Field …, 2021 - ieeexplore.ieee.org
FPGAs have found their way into data centers as accelerator cards, making reconfigurable
computing more accessible for high-performance applications. At the same time, new high …