Preparing sparse solvers for exascale computing
Sparse solvers provide essential functionality for a wide variety of scientific applications.
Highly parallel sparse solvers are essential for continuing advances in high-fidelity, multi …
Highly parallel sparse solvers are essential for continuing advances in high-fidelity, multi …
Pangulu: A scalable regular two-dimensional block-cyclic sparse direct solver on distributed heterogeneous systems
Sparse direct solvers play a vital role in large-scale high performance computing in science
and engineering. Existing distributed sparse direct methods employ multifrontal/supernodal …
and engineering. Existing distributed sparse direct methods employ multifrontal/supernodal …
Unified Communication Optimization Strategies for Sparse Triangular Solver on CPU and GPU Clusters
This paper presents a unified communication optimization framework for sparse triangular
solve (SpTRSV) algorithms on CPU and GPU clusters. The framework builds upon a 3D …
solve (SpTRSV) algorithms on CPU and GPU clusters. The framework builds upon a 3D …
Newly released capabilities in the distributed-memory SuperLU sparse direct solver
We present the new features available in the recent release of SuperLU_DIST, Version 8.1.
1. SuperLU_DIST is a distributed-memory parallel sparse direct solver. The new features …
1. SuperLU_DIST is a distributed-memory parallel sparse direct solver. The new features …
Efficient block algorithms for parallel sparse triangular solve
The sparse triangular solve (SpTRSV) kernel is an important building block for a number of
linear algebra routines such as sparse direct and iterative solvers. The major challenge of …
linear algebra routines such as sparse direct and iterative solvers. The major challenge of …
CommBench: Micro-Benchmarking Hierarchical Networks with Multi-GPU, Multi-NIC Nodes
M Hidayetoglu, SG De Gonzalo, E Slaughter… - Proceedings of the 38th …, 2024 - dl.acm.org
Modern high-performance computing systems have multiple GPUs and network interface
cards (NICs) per node. The resulting network architectures have multilevel hierarchies of …
cards (NICs) per node. The resulting network architectures have multilevel hierarchies of …
Fast and scalable sparse triangular solver for multi-gpu based hpc architectures
Designing efficient and scalable sparse linear algebra kernels on modern multi-GPU based
HPC systems is a challenging task due to significant irregular memory references and …
HPC systems is a challenging task due to significant irregular memory references and …
Algorithm and Software Overhead: A Theoretical Approach to Performance Portability
V Mele, G Laccetti - International Conference on Parallel Processing and …, 2022 - Springer
In the last years, the portability term has enriched itself with new meanings: research
communities are talking about how to measure the degree to which an application (or …
communities are talking about how to measure the degree to which an application (or …
Brief Announcement: Communication Optimal Sparse LU Factorization for Planar Matrices
We introduce a new parallel algorithm for solving sparse LU factorization of planar matrices,
which commonly arise in the finite element method for 2D PDEs. Existing scalable methods …
which commonly arise in the finite element method for 2D PDEs. Existing scalable methods …
[PDF][PDF] Refereed Journals 1. S. Jutamulia, G. Storti and X. Li,“Expert Systems Based on LCTV AND/OR Logic” Optics and Laser Technology, vol. 21, No. 6, 392-394 …
XS Li - Parallel Computing, 2003 - portal.nersc.gov
**aoye Sherry Li Refereed Journals 1. S. Jutamulia, G. Storti and X. Li, “Expert Systems
Based on LCTV AND/OR Logic” Optics Page 1 **aoye Sherry Li Last Update: 11/14/2020 …
Based on LCTV AND/OR Logic” Optics Page 1 **aoye Sherry Li Last Update: 11/14/2020 …