A survey of accelerating parallel sparse linear algebra

G **ao, C Yin, T Zhou, X Li, Y Chen, K Li - ACM Computing Surveys, 2023 - dl.acm.org
Sparse linear algebra includes the fundamental and important operations in various large-
scale scientific computing and real-world applications. There exists performance bottleneck …

A streaming dataflow engine for sparse matrix-vector multiplication using high-level synthesis

M Hosseinabady… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Using high-level synthesis techniques, this paper proposes an adaptable high-performance
streaming dataflow engine for sparse matrix dense vector multiplication (SpMV) suitable for …

A domain-specific architecture for accelerating sparse matrix vector multiplication on fpgas

AK Jain, H Omidian, H Fraisse… - … conference on field …, 2020 - ieeexplore.ieee.org
FPGAs allow custom memory hierarchy and flexible data movement with highly fine-grained
control. These capabilities are critical for building high performance and energy efficient …

An efficient gustavson-based sparse matrix–matrix multiplication accelerator on embedded FPGAs

S Li, S Huai, W Liu - … Transactions on Computer-Aided Design of …, 2023 - ieeexplore.ieee.org
Sparse matrix–matrix multiplication (SpMM) is an important kernel in multiple areas, eg, data
analytics and machine learning. Due to the low on-chip memory requirement, the consistent …

Optimized data reuse via reordering for sparse matrix-vector multiplication on fpgas

S Li, D Liu, W Liu - 2021 IEEE/ACM International Conference …, 2021 - ieeexplore.ieee.org
Sparse matrix-vector multiplication (SpMV) is of paramount importance in both scientific and
engineering applications. The main workload of SpMV is multiplications between randomly …

Efficient FPGA-Based Sparse Matrix–Vector Multiplication With Data Reuse-Aware Compression

S Li, D Liu, WLD Liu - … on Computer-Aided Design of Integrated …, 2023 - ieeexplore.ieee.org
Sparse matrix–vector multiplication (SpMV) on FPGAs has gained much attention. The
performance of SpMV is mainly determined by the number of multiplications between …

[HTML][HTML] Sparse and dense matrix multiplication hardware for heterogeneous multi-precision neural networks

J Nunez-Yanez, M Hosseinabady - Array, 2021 - Elsevier
In this paper, we present hardware accelerators created with high-level synthesis
techniques for sparse and dense matrix multiplication operations. The cores can operate …

Redesk: A reconfigurable dataflow engine for sparse kernels on heterogeneous platforms

K Lu, Z Li, L Liu, J Wang, S Yin… - 2019 IEEE/ACM …, 2019 - ieeexplore.ieee.org
Sparse Matrix-Vector Multiplication (SpMV) is the most important sparse linear algebra
kernel in both scientific and engineering applications. Due to its irregular control flow and …

Entropy Maximization in Sparse Matrix by Vector Multiplication ()

P D'Alberto, A Jain, I Bustany, H Fraisse… - arxiv preprint arxiv …, 2023 - arxiv.org
The peak performance of any SpMV depends primarily on the available memory bandwidth
and its effective use. GPUs, ASICs, and new FPGAs have higher and higher bandwidth; …

Power and Delay Efficient Approximate Sparse Matrix Vector Multiplication on FPGA using HLS

AC Shaji, Z Aizaz, K Khare - 2024 3rd International Conference …, 2024 - ieeexplore.ieee.org
High-Level Synthesis (HLS) tools have the capability of creating register-transfer level from
high-level specifications by not degrading the performance. OpenCL framework uses HLS …