Tileflow: A framework for modeling fusion dataflow via tree-based analysis
With the increasing size of DNN models and the growing discrepancy between compute
performance and memory bandwidth, fusing multiple layers together to reduce off-chip …
performance and memory bandwidth, fusing multiple layers together to reduce off-chip …
Spade: A flexible and scalable accelerator for spmm and sddmm
The widespread use of Sparse Matrix Dense Matrix Multiplication (SpMM) and Sampled
Dense Matrix Dense Matrix Multiplication (SDDMM) kernels makes them candidates for …
Dense Matrix Dense Matrix Multiplication (SDDMM) kernels makes them candidates for …
Spatula: A hardware accelerator for sparse matrix factorization
Solving sparse systems of linear equations is a crucial component in many science and
engineering problems, like simulating physical systems. Sparse matrix factorization …
engineering problems, like simulating physical systems. Sparse matrix factorization …
Trapezoid: A Versatile Accelerator for Dense and Sparse Matrix Multiplications
Accelerating matrix multiplication is crucial to achieve high performance in many application
domains, including neural networks, graph analytics, and scientific computing. These …
domains, including neural networks, graph analytics, and scientific computing. These …
LoAS: Fully Temporal-Parallel Dataflow for Dual-Sparse Spiking Neural Networks
Spiking Neural Networks (SNNs) have gained significant research attention over the past
decade due to their potential for enabling resource-constrained edge devices. While existing …
decade due to their potential for enabling resource-constrained edge devices. While existing …
FEASTA: A Flexible and Efficient Accelerator for Sparse Tensor Algebra in Machine Learning
Recently, sparse tensor algebra (SpTA) plays an increasingly important role in machine
learning. However, due to the unstructured sparsity of SpTA, the general-purpose …
learning. However, due to the unstructured sparsity of SpTA, the general-purpose …
Mentor: A Memory-Efficient Sparse-dense Matrix Multiplication Accelerator Based on Column-Wise Product
Sparse-dense matrix multiplication (SpMM) is the performance bottleneck of many high-
performance and deep-learning applications, making it attractive to design specialized …
performance and deep-learning applications, making it attractive to design specialized …
Vesper: A Versatile Sparse Linear Algebra Accelerator With Configurable Compute Patterns
Sparse linear algebra (SLA) operations are fundamental building blocks for many important
applications such as data analytics, graph processing, machine learning, and scientific …
applications such as data analytics, graph processing, machine learning, and scientific …
Efficient SpMM Accelerator for Deep Learning: Sparkle and Its Automated Generator
S Xu, J Jiang, **wei Xu, X Qian - ACM Transactions on Reconfigurable …, 2024 - dl.acm.org
Deep learning (DL) technology has made breakthroughs in a wide range of intelligent tasks
such as vision, language, recommendation systems, etc. Sparse matrix multiplication …
such as vision, language, recommendation systems, etc. Sparse matrix multiplication …
Sparm: A Sparse Matrix Multiplication Accelerator Supporting Multiple Dataflows
S Luo, B Wang, Y Shi, X Zhang… - 2024 IEEE 35th …, 2024 - ieeexplore.ieee.org
As the main workload of many scientific and machine learning applications, sparse matrix-
matrix multiplication (spGEMM) has become a hot research field. The current spG EMM …
matrix multiplication (spGEMM) has become a hot research field. The current spG EMM …