Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures

C Giannoula, I Fernandez, J Gómez-Luna… - ACM SIGMETRICS …, 2022 - dl.acm.org
Several manufacturers have already started to commercialize near-bank Processing-In-
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …

A medium-grained algorithm for sparse tensor factorization

S Smith, G Karypis - 2016 IEEE International Parallel and …, 2016 - ieeexplore.ieee.org
Modeling multi-way data can be accomplished using tensors, which are data structures
indexed along three or more dimensions. Tensors are increasingly used to analyze …

Hypergraph partitioning for multiple communication cost metrics: Model and methods

M Deveci, K Kaya, B Uçar, ÜV Çatalyürek - Journal of Parallel and …, 2015 - Elsevier
We investigate hypergraph partitioning-based methods for efficient parallelization of
communicating tasks. A good partitioning method should divide the load among the …

Partitioning models for general medium-grain parallel sparse tensor decomposition

MO Karsavuran, S Acer… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
The focus of this article is efficient parallelization of the canonical polyadic decomposition
algorithm utilizing the alternating least squares method for sparse tensors on distributed …

[BUCH][B] Parallel Scientific Computation: A Structured Approach Using BSP

RH Bisseling - 2020 - books.google.com
Building upon the wide-ranging success of the first edition, Parallel Scientific Computation
presents a single unified approach to using a range of parallel computers, from a small …

A geometric partitioning method for distributed tomographic reconstruction

JW Buurlage, RH Bisseling, KJ Batenburg - Parallel Computing, 2019 - Elsevier
Tomography is a powerful technique for 3D imaging of the interior of an object. With the
growing sizes of typical tomographic data sets, the computational requirements for …

Exact k-way sparse matrix partitioning

EL Jenneskens, RH Bisseling - 2022 IEEE International Parallel …, 2022 - ieeexplore.ieee.org
To minimize the communication in parallel sparse matrix-vector multiplication while
maintaining load balance, we need to partition the sparse matrix optimally into k disjoint …

An exact algorithm for sparse matrix bipartitioning

DM Pelt, RH Bisseling - Journal of Parallel and Distributed Computing, 2015 - Elsevier
The sparse matrix partitioning problem arises when minimizing communication in parallel
sparse matrix–vector multiplications. Since the problem is NP-hard, heuristics are usually …

Accelerating Irregular Applications via Efficient Synchronization and Data Access Techniques

C Giannoula - arxiv preprint arxiv:2211.05908, 2022 - arxiv.org
Irregular applications comprise an increasingly important workload domain for many fields,
including bioinformatics, chemistry, physics, social sciences and machine learning …

An improved exact algorithm and an NP-completeness proof for sparse matrix bipartitioning

TE Knigge, RH Bisseling - Parallel Computing, 2020 - Elsevier
We investigate sparse matrix bipartitioning–a problem where we minimize the
communication volume in parallel sparse matrix-vector multiplication. We prove, by …