A survey of direct methods for sparse linear systems

TA Davis, S Rajamanickam, WM Sid-Lakhdar - Acta Numerica, 2016 - cambridge.org
Wilkinson defined a sparse matrix as one with enough zeros that it pays to take advantage of
them. 1 This informal yet practical definition captures the essence of the goal of direct …

Mosaic: Processing a trillion-edge graph on a single machine

S Maass, C Min, S Kashyap, W Kang… - Proceedings of the …, 2017 - dl.acm.org
Processing a one trillion-edge graph has recently been demonstrated by distributed graph
engines running on clusters of tens to hundreds of nodes. In this paper, we employ a single …

Graphin: An online high performance incremental graph processing framework

D Sengupta, N Sundaram, X Zhu, TL Willke… - Euro-Par 2016: Parallel …, 2016 - Springer
The massive explosion in social networks has led to a significant growth in graph analytics
and specifically in dynamic, time-varying graphs. Most prior work processes dynamic graphs …

Parallel graph coloring for manycore architectures

M Deveci, EG Boman, KD Devine… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
Graph algorithms are challenging to parallelize on manycore architectures due to complex
data dependencies and irregular memory access. We consider the well studied problem of …

A case study of complex graph analysis in distributed memory: Implementation and optimization

GM Slota, S Rajamanickam… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
In recent years, a large number of graph processing frameworks have been introduced, with
their goal to simplify analysis of real-world graphs on commodity hardware. Additionally, the …

Embedded ensemble propagation for improving performance, portability, and scalability of uncertainty quantification on emerging computational architectures

E Phipps, M D'Elia, HC Edwards, M Hoemmen… - SIAM Journal on …, 2017 - SIAM
Quantifying simulation uncertainties is a critical component of rigorous predictive simulation.
A key component of this is forward propagation of uncertainties in simulation input data to …

Basker: Parallel sparse LU factorization utilizing hierarchical parallelism and data layouts

JD Booth, ND Ellingwood, HK Thornquist… - Parallel Computing, 2017 - Elsevier
Transient simulation in circuit simulation tools, such as SPICE and Xyce, depend on
scalable and robust sparse LU factorizations for efficient numerical simulation of circuits and …

CCF: An efficient SpMV storage format for AVX512 platforms

M Almasri, W Abu-Sufah - Parallel Computing, 2020 - Elsevier
We present a sparse matrix vector multiplication (SpMV) kernel that uses a novel sparse
matrix storage format and delivers superior performance for unstructured matrices on Intel …

Sparse Matrix-Vector Multiplication Based on Online Arithmetic

SM Cherati, G Jaberipur, L Sousa - IEEE Access, 2024 - ieeexplore.ieee.org
Online arithmetic, where computations are performed from the most significant digit first, has
shown benefits in improving throughput and latency within high-performance computing …

Orchestrating parallel detection of strongly connected components on GPUs

X Chen, C Chen, J Shen, J Fang, T Tang, C Yang… - Parallel Computing, 2018 - Elsevier
Detecting strongly connected components (SCC) is a practical graph analytics algorithm
widely used in many application domains. To accelerate SCC detection, parallel algorithms …