Algebraic methods for interactive proof systems

C Lund, L Fortnow, H Karloff, N Nisan - Journal of the ACM (JACM), 1992 - dl.acm.org
A new algebraic technique for the construction of interactive proof systems is presented. Our
technique is used to prove that every language in the polynomial-time hierarchy has an …

Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects

E Agullo, J Demmel, J Dongarra, B Hadri… - Journal of Physics …, 2009 - iopscience.iop.org
The emergence and continuing use of multi-core architectures and graphics processing
units require changes in the existing software and sometimes even a redesign of the …

Dense linear algebra solvers for multicore with GPU accelerators

S Tomov, R Nath, H Ltaief… - 2010 IEEE International …, 2010 - ieeexplore.ieee.org
Solving dense linear systems of equations is a fundamental problem in scientific computing.
Numerical simulations involving complex systems represented in terms of unknown …

The singular value decomposition: Anatomy of optimizing an algorithm for extreme scale

J Dongarra, M Gates, A Haidar, J Kurzak, P Luszczek… - SIAM review, 2018 - SIAM
The computation of the singular value decomposition, or SVD, has a long history with many
improvements over the years, both in its implementations and algorithmically. Here, we …

Tiled QR factorization algorithms

H Bouwmeester, M Jacquelin, J Langou… - Proceedings of 2011 …, 2011 - dl.acm.org
This work revisits existing algorithms for the QR factorization of rectangular matrices
composed of p× q tiles, where p≥ q. Within this framework, we study the critical paths and …

A survey of recent developments in parallel implementations of Gaussian elimination

S Donfack, J Dongarra, M Faverge… - Concurrency and …, 2015 - Wiley Online Library
Gaussian elimination is a canonical linear algebra procedure for solving linear systems of
equations. In the last few years, the algorithm has received a lot of attention in an attempt to …

QR factorization on a multicore node enhanced with multiple GPU accelerators

E Agullo, C Augonnet, J Dongarra… - … Parallel & Distributed …, 2011 - ieeexplore.ieee.org
One of the major trends in the design of exascale architectures is the use of multicore nodes
enhanced with GPU accelerators. Exploiting all resources of a hybrid accelerators-based …

Enabling and scaling matrix computations on heterogeneous multi-core and multi-GPU systems

F Song, S Tomov, J Dongarra - … of the 26th ACM international conference …, 2012 - dl.acm.org
We present a new approach to utilizing all CPU cores and all GPUs on heterogeneous
multicore and multi-GPU systems to support dense matrix computations efficiently. The main …

Parallel reduction to condensed forms for symmetric eigenvalue problems using aggregated fine-grained and memory-aware kernels

A Haidar, H Ltaief, J Dongarra - Proceedings of 2011 International …, 2011 - dl.acm.org
This paper introduces a novel implementation in reducing a symmetric dense matrix to
tridiagonal form, which is the preprocessing step toward solving symmetric eigenvalue …

Solving acoustic boundary integral equations using high performance tile low-rank LU factorization

N Al-Harthi, R Alomairy, K Akbudak, R Chen… - … Conference, ISC High …, 2020 - Springer
We design and develop a new high performance implementation of a fast direct LU-based
solver using low-rank approximations on massively parallel systems. The LU factorization is …