ScaLAPACK user’s guide J Dongarra, L Blackford, J Choi, A Cleary, E D’Azevedo, J Demmel, ... Society for Industrial and Applied Mathematics, Philadelphia, PA 28, 1997 | 2495* | 1997 |
Automated empirical optimizations of software and the ATLAS project RC Whaley, A Petitet, JJ Dongarra Parallel computing 27 (1-2), 3-35, 2001 | 2013 | 2001 |
Automatically tuned linear algebra software RC Whaley, JJ Dongarra SC'98: Proceedings of the 1998 ACM/IEEE conference on Supercomputing, 38-38, 1998 | 1539 | 1998 |
An updated set of basic linear algebra subprograms (BLAS) LS Blackford, A Petitet, R Pozo, K Remington, RC Whaley, J Demmel, ... ACM Transactions on Mathematical Software 28 (2), 135-151, 2002 | 1162 | 2002 |
LAPACK working note 95 ScaLAPACK: A portable linear algebra library for distributed memory computers-design issues and performance J Choi, J Demmel, I Dhillon, J Dongarra, S Ostrouchov, A Petitet, ... University of Tennessee, 1995 | 679* | 1995 |
Minimizing development and maintenance costs in supporting persistently optimized BLAS RC Whaley, A Petitet Software: Practice and Experience 35 (2), 101-121, 2005 | 378 | 2005 |
Encyclopedia of parallel computing D Padua Springer Science & Business Media, 2011 | 372 | 2011 |
Design and implementation of the ScaLAPACK LU, QR, and Cholesky factorization routines J Choi, JJ Dongarra, LS Ostrouchov, AP Petitet, DW Walker, RC Whaley Scientific Programming 5 (3), 173-184, 1996 | 342 | 1996 |
Self-adapting linear algebra algorithms and software J Demmel, J Dongarra, V Eijkhout, E Fuentes, A Petitet, R Vuduc, ... Proceedings of the IEEE 93 (2), 293-312, 2005 | 277 | 2005 |
A proposal for a set of parallel basic linear algebra subprograms J Choi, J Dongarra, S Ostrouchov, A Petitet, D Walker, RC Whaley Applied Parallel Computing Computations in Physics, Chemistry and …, 1996 | 274 | 1996 |
Two dimensional basic linear algebra communication subprograms JJ Dongarra, RC Whaley, RA van de Geijn Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA …, 1993 | 123 | 1993 |
ScaLAPACK: a linear algebra library for message-passing computers LS Blackford, J Choi, AJ Cleary, EF D'Azevedo, J Demmel, IS Dhillon, ... Proceedings of the Eighth SIAM Conference on Parallel Processing for …, 1997 | 110 | 1997 |
Timing high performance kernels through empirical compilation RC Whaley, DB Whalley 2005 International Conference on Parallel Processing (ICPP'05), 89-98, 2005 | 54 | 2005 |
Scaling LAPACK panel operations using parallel cache assignment AM Castaldo, RC Whaley ACM Sigplan Notices 45 (5), 223-232, 2010 | 53 | 2010 |
ScaLAPACK Users’ Guide. SIAM, Philadelphia, PA, 1997 LS Blackford, J Choi, A Cleary, E d’Azevedo, J Demmel, I Dhillon, ... | 51 | |
Achieving accurate and context‐sensitive timing for code optimization RC Whaley, AM Castaldo Software: Practice and Experience 38 (15), 1621-1642, 2008 | 50 | 2008 |
A User’s Guide to the BLACS J Dongarra, R van de Geijn, RC Whaley Technical Report CS-93-187, University of Tennessee, 1993. LAPACK Working …, 1993 | 50 | 1993 |
LAPACK Working Note 94 A User's Guide to the BLACS v1. JJ Dongarra, RC Whaley Tech.£ eport 13, 1997 | 46 | 1997 |
Atlas (automatically tuned linear algebra software) RC Whaley http://www. netlib. org/atlas/index. html, 2011 | 45 | 2011 |
Basic linear algebra communication subprograms: Analysis and implementation across multiple parallel architectures RC Whaley University of Tennessee, Knoxville, 1994 | 43 | 1994 |