Optimization techniques for GPU programming
In the past decade, Graphics Processing Units have played an important role in the field of
high-performance computing and they still advance new fields such as IoT, autonomous …
high-performance computing and they still advance new fields such as IoT, autonomous …
A survey of accelerating parallel sparse linear algebra
Sparse linear algebra includes the fundamental and important operations in various large-
scale scientific computing and real-world applications. There exists performance bottleneck …
scale scientific computing and real-world applications. There exists performance bottleneck …
A sparse iteration space transformation framework for sparse tensor algebra
We address the problem of optimizing sparse tensor algebra in a compiler and show how to
define standard loop transformations---split, collapse, and reorder---on sparse iteration …
define standard loop transformations---split, collapse, and reorder---on sparse iteration …
Fast and accurate randomized algorithms for low-rank tensor decompositions
Low-rank Tucker and CP tensor decompositions are powerful tools in data analytics. The
widely used alternating least squares (ALS) method, which solves a sequence of over …
widely used alternating least squares (ALS) method, which solves a sequence of over …
Alto: Adaptive linearized storage of sparse tensors
The analysis of high-dimensional sparse data is becoming increasingly popular in many
important domains. However, real-world sparse tensors are challenging to process due to …
important domains. However, real-world sparse tensors are challenging to process due to …
Sparta: High-performance, element-wise sparse tensor contraction on heterogeneous memory
Sparse tensor contractions appear commonly in many applications. Efficiently computing a
two sparse tensor product is challenging: It not only inherits the challenges from common …
two sparse tensor product is challenging: It not only inherits the challenges from common …
An efficient mixed-mode representation of sparse tensors
The Compressed Sparse Fiber (CSF) representation for sparse tensors is a generalization of
the Compressed Sparse Row (CSR) format for sparse matrices. For a tensor with d modes …
the Compressed Sparse Row (CSR) format for sparse matrices. For a tensor with d modes …
Accelerating sparse MTTKRP for tensor decomposition on FPGA
Sparse Matricized Tensor Times Khatri-Rao Product (spMTTKRP) is the most
computationally intensive kernel in sparse tensor decomposition. In this paper, we propose …
computationally intensive kernel in sparse tensor decomposition. In this paper, we propose …
A novel parallel algorithm for sparse tensor matrix chain multiplication via tcu-acceleration
H Wang, W Yang, R Hu, R Ouyang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Analysis of multi-dimensional data, especially tensor decomposition, which extracts latent
information, is becoming considerably popular. Although multi-dimensional sparse data is …
information, is becoming considerably popular. Although multi-dimensional sparse data is …
Athena: High-performance sparse tensor contraction sequence on heterogeneous memory
Sparse tensor contraction sequence has been widely employed in many fields, such as
chemistry and physics. However, how to efficiently implement the sequence faces multiple …
chemistry and physics. However, how to efficiently implement the sequence faces multiple …