Shuhai: Benchmarking high bandwidth memory on fpgas

Z Wang, H Huang, J Zhang… - 2020 IEEE 28th Annual …, 2020 - ieeexplore.ieee.org
FPGAs are starting to be enhanced with High Bandwidth Memory (HBM) as a way to reduce
the memory bandwidth bottleneck encountered in some applications and to give the FPGA …

[PDF][PDF] Is fpga useful for hash joins?

X Chen, Y Chen, R Bajaj, J He, B He, WF Wong… - CIDR, 2020 - comp.nus.edu.sg
Benefiting from the fine-grained parallelism and energy efficiency, heterogeneous
computing platforms featuring FP-GAs are becoming more and more common in data …

FBLAS: Streaming linear algebra on FPGA

T De Matteis, J de Fine Licht… - … conference for high …, 2020 - ieeexplore.ieee.org
Spatial computing architectures pose an attractive alternative to mitigate control and data
movement overheads typical of load-store architectures. In practice, these devices are rarely …

On-the-fly parallel data shuffling for graph processing on OpenCL-based FPGAs

X Chen, R Bajaj, Y Chen, J He, B He… - … Conference on Field …, 2019 - ieeexplore.ieee.org
Graph processing has attracted much attention recently due to its popularity in many big
data analytic applications. With high performance and energy efficiency, FPGAs can be an …

Accelerating generalized linear models with MLWeaving: A one-size-fits-all system for any-precision learning

Z Wang, K Kara, H Zhang, G Alonso, O Mutlu… - Proceedings of the …, 2019 - dl.acm.org
Learning from the data stored in a database is an important function increasingly available
in relational engines. Methods using lower precision input data are of special interest given …

Boyi: A systematic framework for automatically deciding the right execution model of OpenCL applications on FPGAs

J Jiang, Z Wang, X Liu, J Gómez-Luna… - Proceedings of the …, 2020 - dl.acm.org
FPGA vendors provide OpenCL software development kits for easier programmability, with
the goal of replacing the time-consuming and error-prone register-transfer level (RTL) …

OpenCL for HPC with FPGAs: Case study in molecular electrostatics

C Yang, J Sheng, R Patel, A Sanaullah… - 2017 IEEE High …, 2017 - ieeexplore.ieee.org
FPGAs have emerged as a cost-effective accelerator alternative in clouds and clusters.
Programmability remains a challenge, however, with OpenCL being generally recognized …

ACTS: A Near-Memory FPGA Graph Processing Framework

W Jaiyeoba, N Elyasi, C Choi, K Skadron - Proceedings of the 2023 …, 2023 - dl.acm.org
Despite the high off-chip bandwidth and on-chip parallelism offered by today's near-memory
accelerators, software-based (CPU and GPU) graph processing frameworks still suffer …

Optimized implementation of OpenCL kernels on FPGAs

K Shata, MK Elteir, AA El-Zoghabi - Journal of Systems Architecture, 2019 - Elsevier
Abstract Recently Field-Programmable Gate Array (FPGA) vendors, such as Altera and
**linx released an Open Computing Language Software Development Kit (OpenCL SDK) …

Benchmarking high bandwidth memory on fpgas

Z Wang, H Huang, J Zhang, G Alonso - arxiv preprint arxiv:2005.04324, 2020 - arxiv.org
FPGAs are starting to be enhanced with High Bandwidth Memory (HBM) as a way to reduce
the memory bandwidth bottleneck encountered in some applications and to give the FPGA …