Preparing sparse solvers for exascale computing

H Anzt, E Boman, R Falgout… - … of the Royal …, 2020 - royalsocietypublishing.org
Sparse solvers provide essential functionality for a wide variety of scientific applications.
Highly parallel sparse solvers are essential for continuing advances in high-fidelity, multi …

Pangulu: A scalable regular two-dimensional block-cyclic sparse direct solver on distributed heterogeneous systems

X Fu, B Zhang, T Wang, W Li, Y Lu, E Yi… - Proceedings of the …, 2023 - dl.acm.org
Sparse direct solvers play a vital role in large-scale high performance computing in science
and engineering. Existing distributed sparse direct methods employ multifrontal/supernodal …

Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters

Y Liu, N Ding, P Sao, S Williams, XS Li - Proceedings of the International …, 2023 - dl.acm.org
This paper presents a unified communication optimization framework for sparse triangular
solve (SpTRSV) algorithms on CPU and GPU clusters. The framework builds upon a 3D …

Newly released capabilities in the distributed-memory SuperLU sparse direct solver

XS Li, P Lin, Y Liu, P Sao - ACM Transactions on Mathematical Software, 2023 - dl.acm.org
We present the new features available in the recent release of SuperLU_DIST, Version 8.1.
1. SuperLU_DIST is a distributed-memory parallel sparse direct solver. The new features …

Efficient block algorithms for parallel sparse triangular solve

Z Lu, Y Niu, W Liu - Proceedings of the 49th International Conference on …, 2020 - dl.acm.org
The sparse triangular solve (SpTRSV) kernel is an important building block for a number of
linear algebra routines such as sparse direct and iterative solvers. The major challenge of …

CommBench: Micro-Benchmarking Hierarchical Networks with Multi-GPU, Multi-NIC Nodes

M Hidayetoglu, SG De Gonzalo, E Slaughter… - Proceedings of the 38th …, 2024 - dl.acm.org
Modern high-performance computing systems have multiple GPUs and network interface
cards (NICs) per node. The resulting network architectures have multilevel hierarchies of …

Fast and scalable sparse triangular solver for multi-gpu based hpc architectures

C **e, J Chen, J Firoz, J Li, SL Song, K Barker… - Proceedings of the 50th …, 2021 - dl.acm.org
Designing efficient and scalable sparse linear algebra kernels on modern multi-GPU based
HPC systems is a challenging task due to significant irregular memory references and …

Algorithm and Software Overhead: A Theoretical Approach to Performance Portability

V Mele, G Laccetti - International Conference on Parallel Processing and …, 2022 - Springer
In the last years, the portability term has enriched itself with new meanings: research
communities are talking about how to measure the degree to which an application (or …

Brief Announcement: Communication Optimal Sparse LU Factorization for Planar Matrices

P Sao, XS Li - Proceedings of the 35th ACM Symposium on …, 2023 - dl.acm.org
We introduce a new parallel algorithm for solving sparse LU factorization of planar matrices,
which commonly arise in the finite element method for 2D PDEs. Existing scalable methods …

[PDF][PDF] Refereed Journals 1. S. Jutamulia, G. Storti and X. Li,“Expert Systems Based on LCTV AND/OR Logic” Optics and Laser Technology, vol. 21, No. 6, 392-394 …

XS Li - Parallel Computing, 2003 - portal.nersc.gov
**aoye Sherry Li Refereed Journals 1. S. Jutamulia, G. Storti and X. Li, “Expert Systems
Based on LCTV AND/OR Logic” Optics Page 1 **aoye Sherry Li Last Update: 11/14/2020 …