Straggler mitigation in distributed matrix multiplication: Fundamental limits and optimal coding

Q Yu, MA Maddah-Ali… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
We consider the problem of massive matrix multiplication, which underlies many data
analytic applications, in a large-scale distributed system comprising a group of worker …

Improved parallel matrix multiplication using Strassen and Urdhvatiryagbhyam method

YRA Bessant, JG Jency, KM Sagayam… - CCF Transactions on …, 2023 - Springer
The current milieu, encourages rapid growth of wireless communication, multimedia
applications, robotics and graphics to have efficient utilization of resources with high …

The LINPACK benchmark: past, present and future

JJ Dongarra, P Luszczek… - … and Computation: practice …, 2003 - Wiley Online Library
This paper describes the LINPACK Benchmark and some of its variations commonly used to
assess the performance of computer systems. Aside from the LINPACK Benchmark suite, the …

SUMMA: Scalable universal matrix multiplication algorithm

RA Van De Geijn, J Watts - Concurrency: Practice and …, 1997 - Wiley Online Library
In the paper we give a straightforward, highly efficient, scalable implementation of common
matrix multiplication operations. The algorithms are much simpler than previously published …

Many-core algorithms for statistical phylogenetics

MA Suchard, A Rambaut - Bioinformatics, 2009 - academic.oup.com
Motivation: Statistical phylogenetics is computationally intensive, resulting in considerable
attention meted on techniques for parallelization. Codon-based models allow for …

Red-blue pebbling revisited: near optimal parallel matrix-matrix multiplication

G Kwasniewski, M Kabić, M Besta… - Proceedings of the …, 2019 - dl.acm.org
We propose COSMA: a parallel matrix-matrix multiplication algorithm that is near
communication-optimal for all combinations of matrix dimensions, processor counts, and …

[HTML][HTML] Fed-mRMR: A lossless federated feature selection method

J Hermo, V Bolón-Canedo, S Ladra - Information Sciences, 2024 - Elsevier
Feature selection has become a mandatory task in data mining, due to the overwhelming
amount of features in Big Data problems. To handle this high-dimensional data and avoid …

Large-scale correlated electronic structure calculations: the RI-MP2 method on parallel computers

DE Bernholdt, RJ Harrison - Chemical Physics Letters, 1996 - Elsevier
The approximate 'resolution of the identity'second-order many-body perturbation theory
method (RI-MP2) recently introduced by Feyereisen, Fitzgerald and Komornicki utilizes a …

A three-dimensional approach to parallel matrix multiplication

RC Agarwal, SM Balle, FG Gustavson… - IBM Journal of …, 1995 - ieeexplore.ieee.org
A three-dimensional (3D) matrix multiplication algorithm for massively parallel processing
systems is presented. The P processors are configured as a “virtual” processing cube with …

Design and implementation of the ScaLAPACK LU, QR, and Cholesky factorization routines

J Choi, JJ Dongarra, LS Ostrouchov… - Scientific …, 1996 - Wiley Online Library
This article discusses the core factorization routines included in the ScaLAPACK library.
These routines allow the factorization and solution of a dense system of linear equations via …