AutoBridge: Coupling coarse-grained floorplanning and pipelining for high-frequency HLS design on multi-die FPGAs

L Guo, Y Chi, J Wang, J Lau, W Qiao, E Ustun… - The 2021 ACM/SIGDA …, 2021 - dl.acm.org
Despite an increasing adoption of high-level synthesis (HLS) for its design productivity
advantages, there remains a significant gap in the achievable clock frequency between an …

Hbm connect: High-performance hls interconnect for fpga hbm

Y Choi, Y Chi, W Qiao, N Samardzic… - The 2021 ACM/SIGDA …, 2021 - dl.acm.org
With the recent release of High Bandwidth Memory (HBM) based FPGA boards, developers
can now exploit unprecedented external memory bandwidth. This allows more memory …

RapidStream: parallel physical implementation of FPGA HLS designs

L Guo, P Maidee, Y Zhou, C Lavin, J Wang… - Proceedings of the …, 2022 - dl.acm.org
FPGAs require a much longer compilation cycle than conventional computing platforms like
CPUs. In this paper, we shorten the overall compilation time by co-optimizing the HLS …

OverGen: Improving FPGA usability through domain-specific overlay generation

S Liu, J Weng, D Kupsh… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
FPGAs have been proven to be powerful computational accelerators across many types of
workloads. The mainstream programming approach is high level synthesis (HLS), which …

Sextans: A streaming accelerator for general-purpose sparse-matrix dense-matrix multiplication

L Song, Y Chi, A Sohrabizadeh, Y Choi, J Lau… - Proceedings of the …, 2022 - dl.acm.org
Sparse-Matrix Dense-Matrix multiplication (SpMM) is the key operator for a wide range of
applications including scientific computing, graph processing, and deep learning …

Democratizing domain-specific computing

Y Chi, W Qiao, A Sohrabizadeh, J Wang… - Communications of the …, 2022 - dl.acm.org
Democratizing Domain-Specific Computing Page 1 GENERAL-PURPOSE COMPUTERS
ARE widely used in our modern society. There were close to 24 million software …

Shuhai: A tool for benchmarking high bandwidth memory on FPGAs

H Huang, Z Wang, J Zhang, Z He, C Wu… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
FPGAs are starting to incorporate High Bandwidth Memory (HBM) to both reduce the
memory bandwidth bottleneck encountered in some applications and to provide more …

FANS: FPGA-accelerated near-storage sorting

W Qiao, J Oh, L Guo, MCF Chang… - 2021 IEEE 29th Annual …, 2021 - ieeexplore.ieee.org
Large-scale sorting is always an important yet demanding task for data center applications.
In addition to powerful processing capability, high-performance sorting system requires …

MegIS: High-Performance, Energy-Efficient, and Low-Cost Metagenomic Analysis with In-Storage Processing

NM Ghiasi, M Sadrosadati, H Mustafa… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Metagenomics, the study of the genome sequences of diverse organisms in a common
environment, has led to significant advances in many fields. Since the species present in a …

Sorting in memristive memory

MR Alam, MH Najafi, N TaheriNejad - ACM Journal on Emerging …, 2022 - dl.acm.org
Sorting data is needed in many application domains. Traditionally, the data is read from
memory and sent to a general-purpose processor or application-specific hardware for …