A survey of direct methods for sparse linear systems
Wilkinson defined a sparse matrix as one with enough zeros that it pays to take advantage of
them. 1 This informal yet practical definition captures the essence of the goal of direct …
them. 1 This informal yet practical definition captures the essence of the goal of direct …
Mosaic: Processing a trillion-edge graph on a single machine
Processing a one trillion-edge graph has recently been demonstrated by distributed graph
engines running on clusters of tens to hundreds of nodes. In this paper, we employ a single …
engines running on clusters of tens to hundreds of nodes. In this paper, we employ a single …
Graphin: An online high performance incremental graph processing framework
The massive explosion in social networks has led to a significant growth in graph analytics
and specifically in dynamic, time-varying graphs. Most prior work processes dynamic graphs …
and specifically in dynamic, time-varying graphs. Most prior work processes dynamic graphs …
Parallel graph coloring for manycore architectures
Graph algorithms are challenging to parallelize on manycore architectures due to complex
data dependencies and irregular memory access. We consider the well studied problem of …
data dependencies and irregular memory access. We consider the well studied problem of …
A case study of complex graph analysis in distributed memory: Implementation and optimization
In recent years, a large number of graph processing frameworks have been introduced, with
their goal to simplify analysis of real-world graphs on commodity hardware. Additionally, the …
their goal to simplify analysis of real-world graphs on commodity hardware. Additionally, the …
Embedded ensemble propagation for improving performance, portability, and scalability of uncertainty quantification on emerging computational architectures
Quantifying simulation uncertainties is a critical component of rigorous predictive simulation.
A key component of this is forward propagation of uncertainties in simulation input data to …
A key component of this is forward propagation of uncertainties in simulation input data to …
Basker: Parallel sparse LU factorization utilizing hierarchical parallelism and data layouts
JD Booth, ND Ellingwood, HK Thornquist… - Parallel Computing, 2017 - Elsevier
Transient simulation in circuit simulation tools, such as SPICE and Xyce, depend on
scalable and robust sparse LU factorizations for efficient numerical simulation of circuits and …
scalable and robust sparse LU factorizations for efficient numerical simulation of circuits and …
CCF: An efficient SpMV storage format for AVX512 platforms
We present a sparse matrix vector multiplication (SpMV) kernel that uses a novel sparse
matrix storage format and delivers superior performance for unstructured matrices on Intel …
matrix storage format and delivers superior performance for unstructured matrices on Intel …
Sparse Matrix-Vector Multiplication Based on Online Arithmetic
Online arithmetic, where computations are performed from the most significant digit first, has
shown benefits in improving throughput and latency within high-performance computing …
shown benefits in improving throughput and latency within high-performance computing …
Orchestrating parallel detection of strongly connected components on GPUs
Detecting strongly connected components (SCC) is a practical graph analytics algorithm
widely used in many application domains. To accelerate SCC detection, parallel algorithms …
widely used in many application domains. To accelerate SCC detection, parallel algorithms …