Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects E Agullo, J Demmel, J Dongarra, B Hadri, J Kurzak, J Langou, H Ltaief, ... Journal of Physics: Conference Series 180 (1), 012037, 2009 | 592 | 2009 |
A hybridization methodology for high-performance linear algebra software for GPUs E Agullo, C Augonnet, J Dongarra, H Ltaief, R Namyst, S Thibault, ... GPU Computing Gems Jade Edition, 473-484, 2012 | 166 | 2012 |
QR factorization on a multicore node enhanced with multiple GPU accelerators E Agullo, C Augonnet, J Dongarra, M Faverge, H Ltaief, S Thibault, ... 2011 IEEE International Parallel & Distributed Processing Symposium, 932-943, 2011 | 146 | 2011 |
Achieving high performance on supercomputers with a sequential task-based programming model E Agullo, O Aumage, M Faverge, N Furmento, F Pruvost, M Sergent, ... IEEE Transactions on Parallel and Distributed Systems, 2017 | 126 | 2017 |
Comparative study of one-sided factorizations with multiple software packages on multi-core hardware E Agullo, B Hadri, H Ltaief, J Dongarrra Proceedings of the Conference on High Performance Computing Networking …, 2009 | 103 | 2009 |
Task-based FMM for multicore architectures E Agullo, B Bramas, O Coulaud, E Darve, M Messner, T Takahashi SIAM Journal on Scientific Computing 36 (1), C66-C93, 2014 | 96 | 2014 |
LU factorization for accelerator-based systems E Agullo, C Augonnet, J Dongarra, M Faverge, J Langou, H Ltaief, ... 2011 9th IEEE/ACS International Conference on Computer Systems and …, 2011 | 91 | 2011 |
Plasma users guide E Agullo, J Dongarra, B Hadri, J Kurzak, J Langou, J Langou, H Ltaief, ... Technical report, ICL, UTK, 2009 | 69 | 2009 |
Implementing multifrontal sparse solvers for multicore architectures with sequential task flow runtime systems E Agullo, A Buttari, A Guermouche, F Lopez Acm transactions on mathematical software (toms) 43 (2), 1-22, 2016 | 68 | 2016 |
Task‐based FMM for heterogeneous architectures E Agullo, B Bramas, O Coulaud, E Darve, M Messner, T Takahashi Concurrency and Computation: Practice and Experience 28 (9), 2608-2629, 2016 | 65 | 2016 |
Are static schedules so bad? a case study on cholesky factorization E Agullo, O Beaumont, L Eyraud-Dubois, S Kumar 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016 | 64 | 2016 |
Multifrontal QR factorization for multicore architectures over runtime systems E Agullo, A Buttari, A Guermouche, F Lopez Euro-Par 2013 Parallel Processing: 19th International Conference, Aachen …, 2013 | 57 | 2013 |
Block GMRES method with inexact breakdowns and deflated restarting E Agullo, L Giraud, YF Jing SIAM Journal on Matrix Analysis and Applications 35 (4), 1625-1651, 2014 | 54 | 2014 |
QR factorization of tall and skinny matrices in a grid computing environment E Agullo, C Coti, J Dongarra, T Herault, J Langem 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 51 | 2010 |
Tile QR factorization with parallel panel processing for multicore architectures B Hadri, H Ltaief, E Agullo, J Dongarra 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 48 | 2010 |
Robust memory-aware mappings for parallel multifrontal factorizations E Agullo, PR Amestoy, A Buttari, A Guermouche, JY L'Excellent, FH Rouet SIAM Journal on Scientific Computing 38 (3), C256-C279, 2016 | 47 | 2016 |
Analyzing the effect of local rounding error propagation on the maximal attainable accuracy of the pipelined conjugate gradient method S Cools, EF Yetkin, E Agullo, L Giraud, W Vanroose SIAM Journal on Matrix Analysis and Applications 39 (1), 426-450, 2018 | 45 | 2018 |
Towards resilient parallel linear Krylov solvers: recover-restart strategies E Agullo, L Giraud, A Guermouche, J Roman, M Zounon INRIA, 2013 | 42 | 2013 |
Parallel hierarchical hybrid linear solvers for emerging computing platforms E Agullo, L Giraud, A Guermouche, J Roman Comptes rendus. Mécanique 339 (2-3), 96-103, 2011 | 40 | 2011 |
Bridging the gap between OpenMP and task-based runtime systems for the fast multipole method E Agullo, O Aumage, B Bramas, O Coulaud, S Pitoiset IEEE Transactions on Parallel and Distributed Systems 28 (10), 2794-2807, 2017 | 39 | 2017 |