Solving large problem sizes of index-digit algorithms on GPU: FFT and tridiagonal system solvers AP Diéguez, M Amor, J Lobeiras, R Doallo IEEE Transactions on Computers 67 (1), 86-101, 2017 | 23 | 2017 |
New tridiagonal systems solvers on GPU architectures AP Dieguez, M Amor, R Doallo 2015 IEEE 22nd International Conference on High Performance Computing (HiPC …, 2015 | 13 | 2015 |
Efficient scan operator methods on a GPU AP Diéguez, M Amor, R Doallo 2014 IEEE 26th International Symposium on Computer Architecture and High …, 2014 | 12 | 2014 |
TRAVOLTA: GPU acceleration and algorithmic improvements for constructing quantum optimal control fields in photo-excited systems JM Rodríguez-Borbón, X Wang, AP Diéguez, KZ Ibrahim, BM Wong Computer Physics Communications 296, 109017, 2024 | 8 | 2024 |
Efficient high-precision integer multiplication on the GPU AP Dieguez, M Amor, R Doallo, A Nukada, S Matsuoka The International Journal of High Performance Computing Applications 36 (3 …, 2022 | 7 | 2022 |
Tree partitioning reduction: A new parallel partition method for solving tridiagonal systems AP Diéguez, M Amor, R Doallo ACM Transactions on Mathematical Software (TOMS) 45 (3), 1-26, 2019 | 7 | 2019 |
Parallel prefix operations on GPU: tridiagonal system solvers and scan operators AP Diéguez, M Amor, R Doallo The Journal of Supercomputing 75, 1510-1523, 2019 | 5 | 2019 |
Solving multiple tridiagonal systems on a multi-GPU platform AP Dieguez, MA Lopez, RD Biempica 2018 26th Euromicro International Conference on Parallel, Distributed and …, 2018 | 5 | 2018 |
QRCODE: Massively parallelized real-time time-dependent density functional theory for periodic systems M Choi, MS Okyay, AP Dieguez, M Del Ben, KZ Ibrahim, BM Wong Computer Physics Communications 305, 109349, 2024 | 4 | 2024 |
ML-based performance portability for time-dependent density functional theory in HPC environments AP Dieguez, M Choi, X Zhu, BM Wong, KZ Ibrahim 2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking …, 2022 | 4 | 2022 |
BPLG–BMCS: GPU-sorting algorithm using a tuning skeleton library AP Diéguez, M Amor, R Doallo The Journal of Supercomputing 73 (1), 4-16, 2017 | 4 | 2017 |
VAN-DAMME: GPU-accelerated and symmetry-assisted quantum optimal control of multi-qubit systems JM Rodríguez-Borbón, X Wang, AP Diéguez, KZ Ibrahim, BM Wong Computer Physics Communications 307, 109403, 2025 | 1 | 2025 |
Parallel prefix operations on heterogeneous platforms AP Diéguez Universidade da Coruña, 2019 | 1 | 2019 |
Efficient Solving of Scan Primitive on Multi-GPU Systems AP Diéguez, M Amor, R Doallo, A Nukada, S Matsuoka 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018 | 1 | 2018 |
Unconventional nonlinear Hall effects in twisted multilayer 2D materials MS Okyay, M Choi, Q Xu, AP Diéguez, M Del Ben, KZ Ibrahim, BM Wong npj 2D Materials and Applications 9 (1), 1, 2025 | | 2025 |
Cost-Effective Methodology for Complex Tuning Searches in HPC: Navigating Interdependencies and Dimensionality AP Dieguez, M Choi, M Okyay, M Del Ben, BM Wong, KZ Ibrahim 2024 IEEE International Parallel and Distributed Processing Symposium …, 2024 | | 2024 |
Performance Tuning for GPU-Embedded Systems: Machine-Learning-Based and Analytical Model-Driven Tuning Methodologies AP Diéguez, MA López 2023 IEEE 35th International Symposium on Computer Architecture and High …, 2023 | | 2023 |
Tree Partitioning Reduction: A New Parallel Partition Method for Solving Tridiagonal Systems A Pérez Diéguez, M Amor, R Doallo ACM Transactions on Mathematical Software 45 (3), 593-617, 2019 | | 2019 |
Parallel prefix operations on heterogeneous platforms A Pérez Diéguez | | 2018 |
Techniques for Autotuning Algorithms on Heterogenous Platforms AP Diéguez, M Amor, R Doallo Computing Systems (NESUS PhD 2016) Timisoara, Romania, 25, 2016 | | 2016 |