[KNIHA][B] ScaLAPACK users' guide

LS Blackford, J Choi, A Cleary, E D'Azevedo, J Demmel… - 1997 - SIAM
Following the initial release of LAPACK and the emerging importance of distributed memory
computing, work began on adapting LAPACK to distributed-memory architectures. Since …

LogP: Towards a realistic model of parallel computation

D Culler, R Karp, D Patterson, A Sahay… - Proceedings of the …, 1993 - dl.acm.org
A vast body of theoretical research has focused either on overly simplistic models of parallel
computation, notably the PRAM, or overly specific models that have few representatives in …

ScaLAPACK: A scalable linear algebra library for distributed memory concurrent computers

J Choi, JJ Dongarra, R Pozo… - The Fourth Symposium on …, 1992 - computer.org
Cotton, popularly known as White Gold has been an important commercial crop of National
significance due to the immense influence of its rural economy. Transfer of technology to …

Elemental: A new framework for distributed memory dense matrix computations

J Poulson, B Marker, RA Van de Geijn… - ACM Transactions on …, 2013 - dl.acm.org
Parallelizing dense matrix computations to distributed memory architectures is a well-
studied subject and generally considered to be among the best understood domains of …

ScaLAPACK: A portable linear algebra library for distributed memory computers—Design issues and performance

J Choi, J Demmel, I Dhillon, J Dongarra… - Computer Physics …, 1996 - Elsevier
This paper outlines the content and performance of ScaLAPACK, a collection of
mathematical software for linear algebra computations on distributed memory computers …

PUMMA: Parallel universal matrix multiplication algorithms on distributed memory concurrent computers

J Choi, DW Walker, JJ Dongarra - Concurrency: Practice and …, 1994 - Wiley Online Library
Abstract The paper describes Parallel Universal Matrix Multiplication Algorithms (PUMMA)
on distributed memory concurrent computers. The PUMMA package includes not only the …

[KNIHA][B] A new O (N (2)) algorithm for the symmetric tridiagonal eigenvalue/eigenvector problem

IS Dhillon - 1997 - search.proquest.com
Computing the eigenvalues and orthogonal eigenvectors of an $ n\times n $ symmetric
tridiagonal matrix is an important task that arises while solving any symmetric eigenproblem …

Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems

F Song, A YarKhan, J Dongarra - Proceedings of the conference on high …, 2009 - dl.acm.org
This paper presents a dynamic task scheduling approach to executing dense linear algebra
algorithms on multicore systems (either shared-memory or distributed-memory). We use a …

Software libraries for linear algebra computations on high performance computers

JJ Dongarra, DW Walker - SIAM review, 1995 - SIAM
This paper discusses the design of linear algebra libraries for high performance computers.
Particular emphasis is placed on the development of scalable algorithms for multiple …

FORTRAN MA Language for Modular Parallel Programming

IT Foster, KM Chandy - Journal of Parallel and Distributed Computing, 1995 - Elsevier
FORTRAN M is a small set of extensions to FORTRAN 77 that supports a modular approach
to the design of message-passing programs. It has the following features.(1) Modularity …