Optimization techniques for GPU programming

P Hijma, S Heldens, A Sclocco… - ACM Computing …, 2023 - dl.acm.org
In the past decade, Graphics Processing Units have played an important role in the field of
high-performance computing and they still advance new fields such as IoT, autonomous …

A survey of accelerating parallel sparse linear algebra

G **ao, C Yin, T Zhou, X Li, Y Chen, K Li - ACM Computing Surveys, 2023 - dl.acm.org
Sparse linear algebra includes the fundamental and important operations in various large-
scale scientific computing and real-world applications. There exists performance bottleneck …

A sparse iteration space transformation framework for sparse tensor algebra

R Senanayake, C Hong, Z Wang, A Wilson… - Proceedings of the …, 2020 - dl.acm.org
We address the problem of optimizing sparse tensor algebra in a compiler and show how to
define standard loop transformations---split, collapse, and reorder---on sparse iteration …

Fast and accurate randomized algorithms for low-rank tensor decompositions

L Ma, E Solomonik - Advances in neural information …, 2021 - proceedings.neurips.cc
Low-rank Tucker and CP tensor decompositions are powerful tools in data analytics. The
widely used alternating least squares (ALS) method, which solves a sequence of over …

Alto: Adaptive linearized storage of sparse tensors

AE Helal, J Laukemann, F Checconi, JJ Tithi… - Proceedings of the …, 2021 - dl.acm.org
The analysis of high-dimensional sparse data is becoming increasingly popular in many
important domains. However, real-world sparse tensors are challenging to process due to …

Sparta: High-performance, element-wise sparse tensor contraction on heterogeneous memory

J Liu, J Ren, R Gioiosa, D Li, J Li - … on Principles and Practice of Parallel …, 2021 - dl.acm.org
Sparse tensor contractions appear commonly in many applications. Efficiently computing a
two sparse tensor product is challenging: It not only inherits the challenges from common …

An efficient mixed-mode representation of sparse tensors

I Nisa, J Li, A Sukumaran-Rajam, PS Rawat… - Proceedings of the …, 2019 - dl.acm.org
The Compressed Sparse Fiber (CSF) representation for sparse tensors is a generalization of
the Compressed Sparse Row (CSR) format for sparse matrices. For a tensor with d modes …

Accelerating sparse MTTKRP for tensor decomposition on FPGA

S Wijeratne, TY Wang, R Kannan… - Proceedings of the 2023 …, 2023 - dl.acm.org
Sparse Matricized Tensor Times Khatri-Rao Product (spMTTKRP) is the most
computationally intensive kernel in sparse tensor decomposition. In this paper, we propose …

A novel parallel algorithm for sparse tensor matrix chain multiplication via tcu-acceleration

H Wang, W Yang, R Hu, R Ouyang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Analysis of multi-dimensional data, especially tensor decomposition, which extracts latent
information, is becoming considerably popular. Although multi-dimensional sparse data is …

Athena: High-performance sparse tensor contraction sequence on heterogeneous memory

J Liu, D Li, R Gioiosa, J Li - Proceedings of the 35th ACM International …, 2021 - dl.acm.org
Sparse tensor contraction sequence has been widely employed in many fields, such as
chemistry and physics. However, how to efficiently implement the sequence faces multiple …