AutoBridge: Coupling coarse-grained floorplanning and pipelining for high-frequency HLS design on multi-die FPGAs
Despite an increasing adoption of high-level synthesis (HLS) for its design productivity
advantages, there remains a significant gap in the achievable clock frequency between an …
advantages, there remains a significant gap in the achievable clock frequency between an …
Hbm connect: High-performance hls interconnect for fpga hbm
With the recent release of High Bandwidth Memory (HBM) based FPGA boards, developers
can now exploit unprecedented external memory bandwidth. This allows more memory …
can now exploit unprecedented external memory bandwidth. This allows more memory …
RapidStream: parallel physical implementation of FPGA HLS designs
FPGAs require a much longer compilation cycle than conventional computing platforms like
CPUs. In this paper, we shorten the overall compilation time by co-optimizing the HLS …
CPUs. In this paper, we shorten the overall compilation time by co-optimizing the HLS …
OverGen: Improving FPGA usability through domain-specific overlay generation
FPGAs have been proven to be powerful computational accelerators across many types of
workloads. The mainstream programming approach is high level synthesis (HLS), which …
workloads. The mainstream programming approach is high level synthesis (HLS), which …
Sextans: A streaming accelerator for general-purpose sparse-matrix dense-matrix multiplication
Sparse-Matrix Dense-Matrix multiplication (SpMM) is the key operator for a wide range of
applications including scientific computing, graph processing, and deep learning …
applications including scientific computing, graph processing, and deep learning …
Democratizing domain-specific computing
Democratizing Domain-Specific Computing Page 1 GENERAL-PURPOSE COMPUTERS
ARE widely used in our modern society. There were close to 24 million software …
ARE widely used in our modern society. There were close to 24 million software …
Shuhai: A tool for benchmarking high bandwidth memory on FPGAs
FPGAs are starting to incorporate High Bandwidth Memory (HBM) to both reduce the
memory bandwidth bottleneck encountered in some applications and to provide more …
memory bandwidth bottleneck encountered in some applications and to provide more …
FANS: FPGA-accelerated near-storage sorting
Large-scale sorting is always an important yet demanding task for data center applications.
In addition to powerful processing capability, high-performance sorting system requires …
In addition to powerful processing capability, high-performance sorting system requires …
MegIS: High-Performance, Energy-Efficient, and Low-Cost Metagenomic Analysis with In-Storage Processing
Metagenomics, the study of the genome sequences of diverse organisms in a common
environment, has led to significant advances in many fields. Since the species present in a …
environment, has led to significant advances in many fields. Since the species present in a …
Sorting in memristive memory
Sorting data is needed in many application domains. Traditionally, the data is read from
memory and sent to a general-purpose processor or application-specific hardware for …
memory and sent to a general-purpose processor or application-specific hardware for …