The sparse polyhedral framework: Composing compiler-generated inspector-executor code
Irregular applications such as big graph analysis, material simulations, molecular dynamics
simulations, and finite element analysis have performance problems due to their use of …
simulations, and finite element analysis have performance problems due to their use of …
Outerspace: An outer product based sparse matrix multiplication accelerator
Sparse matrices are widely used in graph and data analytics, machine learning, engineering
and scientific applications. This paper describes and analyzes OuterSPACE, an accelerator …
and scientific applications. This paper describes and analyzes OuterSPACE, an accelerator …
Model-driven autotuning of sparse matrix-vector multiply on GPUs
We present a performance model-driven framework for automated performance tuning
(autotuning) of sparse matrix-vector multiply (SpMV) on systems accelerated by graphics …
(autotuning) of sparse matrix-vector multiply (SpMV) on systems accelerated by graphics …
Efficient sparse matrix-vector multiplication on x86-based many-core processors
Sparse matrix-vector multiplication (SpMV) is an important kernel in many scientific
applications and is known to be memory bandwidth limited. On modern processors with …
applications and is known to be memory bandwidth limited. On modern processors with …
OSKI: A library of automatically tuned sparse matrix kernels
Abstract The Optimized Sparse Kernel Interface (OSKI) is a collection of low-level primitives
that provide automatically tuned computational kernels on sparse matrices, for use by solver …
that provide automatically tuned computational kernels on sparse matrices, for use by solver …
Sparsep: Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures
Several manufacturers have already started to commercialize near-bank Processing-In-
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …
TileSpGEMM: A tiled algorithm for parallel sparse general matrix-matrix multiplication on GPUs
Sparse general matrix-matrix multiplication (SpGEMM) is one of the most fundamental
building blocks in sparse linear solvers, graph processing frameworks and machine learning …
building blocks in sparse linear solvers, graph processing frameworks and machine learning …
Exposing fine-grained parallelism in algebraic multigrid methods
Algebraic multigrid methods for large, sparse linear systems are a necessity in many
computational simulations, yet parallel algorithms for such solvers are generally …
computational simulations, yet parallel algorithms for such solvers are generally …
Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures
Several manufacturers have already started to commercialize near-bank Processing-In-
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …
Smash: Co-designing software compression and hardware-accelerated indexing for efficient sparse matrix operations
Important workloads, such as machine learning and graph analytics applications, heavily
involve sparse linear algebra operations. These operations use sparse matrix compression …
involve sparse linear algebra operations. These operations use sparse matrix compression …