Google Académico

MM Strout, M Hall, C Olschanowsky - Proceedings of the IEEE, 2018 - ieeexplore.ieee.org

Irregular applications such as big graph analysis, material simulations, molecular dynamics
simulations, and finite element analysis have performance problems due to their use of …

Guardar Citar Citado por 90 Artículos relacionados Las 3 versiones

[Free GPT-4]
[DeepSeek]

[PDF] umich.edu

Outerspace: An outer product based sparse matrix multiplication accelerator

S Pal, J Beaumont, DH Park… - … Symposium on High …, 2018 - ieeexplore.ieee.org

Sparse matrices are widely used in graph and data analytics, machine learning, engineering
and scientific applications. This paper describes and analyzes OuterSPACE, an accelerator …

Guardar Citar Citado por 313 Artículos relacionados Las 6 versiones

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

Model-driven autotuning of sparse matrix-vector multiply on GPUs

JW Choi, A Singh, RW Vuduc - ACM sigplan notices, 2010 - dl.acm.org

We present a performance model-driven framework for automated performance tuning
(autotuning) of sparse matrix-vector multiply (SpMV) on systems accelerated by graphics …

Guardar Citar Citado por 580 Artículos relacionados Las 15 versiones

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

Efficient sparse matrix-vector multiplication on x86-based many-core processors

X Liu, M Smelyanskiy, E Chow, P Dubey - Proceedings of the 27th …, 2013 - dl.acm.org

Sparse matrix-vector multiplication (SpMV) is an important kernel in many scientific
applications and is known to be memory bandwidth limited. On modern processors with …

Guardar Citar Citado por 343 Artículos relacionados Las 9 versiones

[Free GPT-4]
[DeepSeek]

[PDF] iop.org

OSKI: A library of automatically tuned sparse matrix kernels

R Vuduc, JW Demmel, KA Yelick - Journal of Physics …, 2005 - iopscience.iop.org

Abstract The Optimized Sparse Kernel Interface (OSKI) is a collection of low-level primitives
that provide automatically tuned computational kernels on sparse matrices, for use by solver …

Guardar Citar Citado por 728 Artículos relacionados Las 14 versiones Búsqueda de bibliotecas

[Free GPT-4]
[DeepSeek]

[PDF] ethz.ch

Sparsep: Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures

C Giannoula, I Fernandez, JG Luna, N Koziris… - Proceedings of the …, 2022 - dl.acm.org

Several manufacturers have already started to commercialize near-bank Processing-In-
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …

Guardar Citar Citado por 66 Artículos relacionados Las 3 versiones

[Free GPT-4]
[DeepSeek]

[PDF] ssslab.cn

TileSpGEMM: A tiled algorithm for parallel sparse general matrix-matrix multiplication on GPUs

Y Niu, Z Lu, H Ji, S Song, Z **, W Liu - Proceedings of the 27th ACM …, 2022 - dl.acm.org

Sparse general matrix-matrix multiplication (SpGEMM) is one of the most fundamental
building blocks in sparse linear solvers, graph processing frameworks and machine learning …

Guardar Citar Citado por 50 Artículos relacionados Las 4 versiones

[Free GPT-4]
[DeepSeek]

[PDF] psu.edu

Exposing fine-grained parallelism in algebraic multigrid methods

N Bell, S Dalton, LN Olson - SIAM Journal on Scientific Computing, 2012 - SIAM

Algebraic multigrid methods for large, sparse linear systems are a necessity in many
computational simulations, yet parallel algorithms for such solvers are generally …

Guardar Citar Citado por 278 Artículos relacionados Las 15 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures

C Giannoula, I Fernandez, J Gómez-Luna… - ACM SIGMETRICS …, 2022 - dl.acm.org

Several manufacturers have already started to commercialize near-bank Processing-In-
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …

Guardar Citar Citado por 45 Artículos relacionados Las 10 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Smash: Co-designing software compression and hardware-accelerated indexing for efficient sparse matrix operations

K Kanellopoulos, N Vijaykumar, C Giannoula… - Proceedings of the …, 2019 - dl.acm.org

Important workloads, such as machine learning and graph analytics applications, heavily
involve sparse linear algebra operations. These operations use sparse matrix compression …

Guardar Citar Citado por 117 Artículos relacionados Las 6 versiones

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

The sparse polyhedral framework: Composing compiler-generated inspector-executor code

Outerspace: An outer product based sparse matrix multiplication accelerator

Model-driven autotuning of sparse matrix-vector multiply on GPUs

Efficient sparse matrix-vector multiplication on x86-based many-core processors

OSKI: A library of automatically tuned sparse matrix kernels

Sparsep: Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures

TileSpGEMM: A tiled algorithm for parallel sparse general matrix-matrix multiplication on GPUs

Exposing fine-grained parallelism in algebraic multigrid methods

Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures

Smash: Co-designing software compression and hardware-accelerated indexing for efficient sparse matrix operations