Sparsetir: Composable abstractions for sparse compilation in deep learning
Sparse tensors are rapidly becoming critical components of modern deep learning
workloads. However, develo** high-performance sparse operators can be difficult and …
workloads. However, develo** high-performance sparse operators can be difficult and …
Sparse supernodal solver using block low-rank compression: Design, performance and analysis
This paper presents two approaches using a Block Low-Rank (BLR) compression technique
to reduce the memory footprint and/or the time-to-solution of the sparse supernodal solver …
to reduce the memory footprint and/or the time-to-solution of the sparse supernodal solver …
A comparison of two effective methods for reordering columns within supernodes
In some recent papers, researchers have found two very good methods for reordering
columns within supernodes in sparse Cholesky factors; these reorderings can be very useful …
columns within supernodes in sparse Cholesky factors; these reorderings can be very useful …
Improving predication efficiency through compaction/restoration of simd instructions
Vector processors offer a wide range of unexplored opportunities to improve performance
and energy efficiency. However, despite its potential, vector code generation and execution …
and energy efficiency. However, despite its potential, vector code generation and execution …
Sparsity analysis and optimization for state-space-based simulation of power electronic systems
Computer-aided analysis tool is playing an important role in the design of power electronics
converters. However, with the increase of system scale and complexity, the existing …
converters. However, with the increase of system scale and complexity, the existing …
Sparse supernodal solver using block low-rank compression
This paper presents two approaches using a Block Low-Rank (BLR) compression technique
to reduce the memory footprint and/or the time-to-solution of the sparse supernodal solver …
to reduce the memory footprint and/or the time-to-solution of the sparse supernodal solver …
Some new techniques to use in serial sparse Cholesky factorization algorithms
We present a new variant of serial right-looking supernodal sparse Cholesky factorization
(RL). Our comparison of RL with the multifrontal method confirms that RL is simpler, slightly …
(RL). Our comparison of RL with the multifrontal method confirms that RL is simpler, slightly …
Optimizing partitioned CSR-based SpGEMM on the Sunway TaihuLight
General sparse matrix-sparse matrix (SpGEMM) multiplication is one of the basic kernels in
a great many applications. Several works focus on various optimizations for SpGEMM. To …
a great many applications. Several works focus on various optimizations for SpGEMM. To …
Improved sparsity techniques for solving network equations in transient stability simulations
T **ao, J Wang, Y Gao, D Gan - IEEE Transactions on Power …, 2018 - ieeexplore.ieee.org
When solving network algebraic equations during power system transient stability
simulations, the nonzero elements in the independent vector and the elements needed in …
simulations, the nonzero elements in the independent vector and the elements needed in …
Blocking Sparse Matrices to Leverage Dense-Specific Multiplication
Research to accelerate matrix multiplication, pushed by the growing computational
demands of deep learning, has sprouted many efficient architectural solutions, such as …
demands of deep learning, has sprouted many efficient architectural solutions, such as …