Micro-Benchmarking MPI Partitioned Point-to-Point Communication Y Hassan Temucin, RE Grant, A Afsahi Proceedings of the 51st International Conference on Parallel Processing, 1-12, 2022 | 14 | 2022 |
Efficient Multi-Path NVLink/PCIe-Aware UCX based Collective Communication for Deep Learning YH Temuçin, AH Sojoodi, P Alizadeh, A Afsahi | 11 | 2021 |
Efficient Process Arrival Pattern Aware Collective Communication for Deep Learning P Alizadeh, A Sojoodi, Y Hassan Temucin, A Afsahi Proceedings of the 29th European MPI Users' Group Meeting, 68-78, 2022 | 9 | 2022 |
Accelerating Deep Learning Using Interconnect-Aware UCX Communication for MPI Collectives YH Temuçin, AH Sojoodi, P Alizadeh, B Kitor, A Afsahi IEEE Micro 42 (2), 68-76, 2022 | 9 | 2022 |
A Dynamic Network-Native MPI Partitioned Aggregation Over InfiniBand Verbs YH Temuçin, S Levy, W Schonbein, RE Grant, A Afsahi 2023 IEEE International Conference on Cluster Computing (CLUSTER), 259-270, 2023 | 7 | 2023 |
ROCm-Aware Leader-based Designs for MPI Neighbourhood Collectives YH Temuçin, M Gazimirsaeed, RE Grant, A Afsahi ISC High Performance 2024 Research Paper Proceedings (39th International …, 2024 | 2 | 2024 |
Design and Implementation of MPI-Native GPU-Initiated MPI Partitioned Communication YH Temuçin, W Schonbein, S Levy, A Sojoodi, RE Grant, A Afsahi IEEE/ACM International Workshop on Exascale MPI (ExaMPI), held in conjuction …, 0 | 2* | |
Enhancing Intra-Node GPU-to-GPU Performance in MPI+ UCX through Multi-Path Communication A Sojoodi, YH Temucin, A Afsahi Proceedings of the 3rd International Workshop on Extreme Heterogeneity …, 2024 | 1 | 2024 |
High-Performance Network-and GPU-Aware Communication for MPI Partitioned and MPI Neighbourhoods YH Temuçin Queen’s University, 2024 | | 2024 |
High-Performance Interconnect-Aware MPI communication for Deep Learning Workloads YH Temucin Queen's University (Canada), 2021 | | 2021 |