Tileflow: A framework for modeling fusion dataflow via tree-based analysis

S Zheng, S Chen, S Gao, L Jia, G Sun… - Proceedings of the 56th …, 2023 - dl.acm.org
With the increasing size of DNN models and the growing discrepancy between compute
performance and memory bandwidth, fusing multiple layers together to reduce off-chip …

Spade: A flexible and scalable accelerator for spmm and sddmm

G Gerogiannis, S Yesil, D Lenadora, D Cao… - Proceedings of the 50th …, 2023 - dl.acm.org
The widespread use of Sparse Matrix Dense Matrix Multiplication (SpMM) and Sampled
Dense Matrix Dense Matrix Multiplication (SDDMM) kernels makes them candidates for …

Spatula: A hardware accelerator for sparse matrix factorization

A Feldmann, D Sanchez - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
Solving sparse systems of linear equations is a crucial component in many science and
engineering problems, like simulating physical systems. Sparse matrix factorization …

Trapezoid: A Versatile Accelerator for Dense and Sparse Matrix Multiplications

Y Yang, JS Emer, D Sanchez - 2024 ACM/IEEE 51st Annual …, 2024 - ieeexplore.ieee.org
Accelerating matrix multiplication is crucial to achieve high performance in many application
domains, including neural networks, graph analytics, and scientific computing. These …

LoAS: Fully Temporal-Parallel Dataflow for Dual-Sparse Spiking Neural Networks

R Yin, Y Kim, D Wu, P Panda - 2024 57th IEEE/ACM …, 2024 - ieeexplore.ieee.org
Spiking Neural Networks (SNNs) have gained significant research attention over the past
decade due to their potential for enabling resource-constrained edge devices. While existing …

FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Algebra in Machine Learning

K Zhong, Z Zhu, G Dai, H Wang, X Yang… - Proceedings of the 29th …, 2024 - dl.acm.org
Recently, sparse tensor algebra (SpTA) plays an increasingly important role in machine
learning. However, due to the unstructured sparsity of SpTA, the general-purpose …

Mentor: A Memory-Efficient Sparse-dense Matrix Multiplication Accelerator Based on Column-Wise Product

X Lu, J Fang, L Peng, C Huang, Z Du, Y Zhao… - ACM Transactions on …, 2024 - dl.acm.org
Sparse-dense matrix multiplication (SpMM) is the performance bottleneck of many high-
performance and deep-learning applications, making it attractive to design specialized …

Vesper: A Versatile Sparse Linear Algebra Accelerator With Configurable Compute Patterns

H **, Z Yue, Z Zhao, Y Du, C Deng… - … on Computer-Aided …, 2024 - ieeexplore.ieee.org
Sparse linear algebra (SLA) operations are fundamental building blocks for many important
applications such as data analytics, graph processing, machine learning, and scientific …

Efficient SpMM Accelerator for Deep Learning: Sparkle and Its Automated Generator

S Xu, J Jiang, **wei Xu, X Qian - ACM Transactions on Reconfigurable …, 2024 - dl.acm.org
Deep learning (DL) technology has made breakthroughs in a wide range of intelligent tasks
such as vision, language, recommendation systems, etc. Sparse matrix multiplication …

Sparm: A Sparse Matrix Multiplication Accelerator Supporting Multiple Dataflows

S Luo, B Wang, Y Shi, X Zhang… - 2024 IEEE 35th …, 2024 - ieeexplore.ieee.org
As the main workload of many scientific and machine learning applications, sparse matrix-
matrix multiplication (spGEMM) has become a hot research field. The current spG EMM …