Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures
Several manufacturers have already started to commercialize near-bank Processing-In-
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …
A medium-grained algorithm for sparse tensor factorization
S Smith, G Karypis - 2016 IEEE International Parallel and …, 2016 - ieeexplore.ieee.org
Modeling multi-way data can be accomplished using tensors, which are data structures
indexed along three or more dimensions. Tensors are increasingly used to analyze …
indexed along three or more dimensions. Tensors are increasingly used to analyze …
Hypergraph partitioning for multiple communication cost metrics: Model and methods
We investigate hypergraph partitioning-based methods for efficient parallelization of
communicating tasks. A good partitioning method should divide the load among the …
communicating tasks. A good partitioning method should divide the load among the …
Partitioning models for general medium-grain parallel sparse tensor decomposition
The focus of this article is efficient parallelization of the canonical polyadic decomposition
algorithm utilizing the alternating least squares method for sparse tensors on distributed …
algorithm utilizing the alternating least squares method for sparse tensors on distributed …
[BUCH][B] Parallel Scientific Computation: A Structured Approach Using BSP
RH Bisseling - 2020 - books.google.com
Building upon the wide-ranging success of the first edition, Parallel Scientific Computation
presents a single unified approach to using a range of parallel computers, from a small …
presents a single unified approach to using a range of parallel computers, from a small …
A geometric partitioning method for distributed tomographic reconstruction
Tomography is a powerful technique for 3D imaging of the interior of an object. With the
growing sizes of typical tomographic data sets, the computational requirements for …
growing sizes of typical tomographic data sets, the computational requirements for …
Exact k-way sparse matrix partitioning
EL Jenneskens, RH Bisseling - 2022 IEEE International Parallel …, 2022 - ieeexplore.ieee.org
To minimize the communication in parallel sparse matrix-vector multiplication while
maintaining load balance, we need to partition the sparse matrix optimally into k disjoint …
maintaining load balance, we need to partition the sparse matrix optimally into k disjoint …
An exact algorithm for sparse matrix bipartitioning
The sparse matrix partitioning problem arises when minimizing communication in parallel
sparse matrix–vector multiplications. Since the problem is NP-hard, heuristics are usually …
sparse matrix–vector multiplications. Since the problem is NP-hard, heuristics are usually …
Accelerating Irregular Applications via Efficient Synchronization and Data Access Techniques
C Giannoula - arxiv preprint arxiv:2211.05908, 2022 - arxiv.org
Irregular applications comprise an increasingly important workload domain for many fields,
including bioinformatics, chemistry, physics, social sciences and machine learning …
including bioinformatics, chemistry, physics, social sciences and machine learning …
An improved exact algorithm and an NP-completeness proof for sparse matrix bipartitioning
We investigate sparse matrix bipartitioning–a problem where we minimize the
communication volume in parallel sparse matrix-vector multiplication. We prove, by …
communication volume in parallel sparse matrix-vector multiplication. We prove, by …