Communication lower bounds and optimal algorithms for numerical linear algebra

G Ballard, E Carson, J Demmel, M Hoemmen… - Acta Numerica, 2014 - cambridge.org
The traditional metric for the efficiency of a numerical algorithm has been the number of
arithmetic operations it performs. Technological trends have long been reducing the time to …

[LIBRO][B] Parallel Computers 2: architecture, programming and algorithms

RW Hockney, CR Jesshope - 2019 - taylorfrancis.com
Since the publication of the first edition, parallel computing technology has gained
considerable momentum. A large proportion of this has come from the improvement in VLSI …

[LIBRO][B] Introduction to parallel and vector solution of linear systems

JM Ortega - 2013 - books.google.com
Although the origins of parallel computing go back to the last century, it was only in the
1970s that parallel and vector computers became available to the scientific community. The …

Optimum broadcasting and personalized communication in hypercubes

SL Johnsson, CT Ho - IEEE Transactions on computers, 1989 - ieeexplore.ieee.org
Four different communication problems are addressed in Boolean n-cube configured
multiprocessors:(1) one-to-all broadcasting: distribution of common data from a single …

Solution of partial differential equations on vector and parallel computers

JM Ortega, RG Voigt - SIAM review, 1985 - SIAM
In this work we review the present status of numerical methods for partial differential
equations on vector and parallel computers. A discussion of the relevant aspects of these …

Multiprocessor ffts

PN Swarztrauber - Parallel computing, 1987 - Elsevier
Several multiprocessor FFTs are developed in this paper for both vector multiprocessors
with shared memory and the hypercube. Two FFTs for vector multiprocessors are given that …

Parallel algorithms for dense linear algebra computations

KA Gallivan, RJ Plemmons, AH Sameh - SIAM review, 1990 - SIAM
Scientific and engineering research is becoming increasingly dependent upon the
development and implementation of efficient parallel algorithms on modern high …

Hypernet: A communication-efficient architecture for constructing massively parallel computers

K Hwang, J Ghosh - IEEE Transactions on Computers, 1987 - ieeexplore.ieee.org
A new class of modular networks is proposed for hierarchically constructing massively
parallel computer systems for distributed supercomputing and AI applications. These …

Data communication in parallel architectures

Y Saad, MH Schultz - Parallel Computing, 1989 - Elsevier
In this paper we consider different methods for exchanging data among processors in
parallel computers. The most common data exchange operations in parallel numerical …

Impact of hierarchical memory systems on linear algebra algorithm design

K Gallivan, W Jalby, U Meier… - … International Journal of …, 1988 - journals.sagepub.com
Linear algebra algorithms based on the BLAS or ex tended BLAS do not achieve high
performance on mul tivector processors with a hierarchical memory system because of a …