オープン アクセスを義務付けられた論文 - Kawthar Shafie Khorassani詳細
一般には非公開: 1 件
High-performance adaptive MPI derived datatype communication for modern Multi-GPU systems
CH Chu, JM Hashmi, KS Khorassani, H Subramoni, DK Panda
2019 IEEE 26th International Conference on High Performance Computing, Data …, 2019
委任: US National Science Foundation
一般公開: 13 件
Nv-group: link-efficient reduction for distributed deep learning on modern dense gpu systems
CH Chu, P Kousha, AA Awan, KS Khorassani, H Subramoni, DK Panda
Proceedings of the 34th ACM International Conference on Supercomputing, 1-12, 2020
委任: US National Science Foundation
Designing a ROCm-aware MPI library for AMD GPUs: early experiences
K Shafie Khorassani, J Hashmi, CH Chu, CC Chen, H Subramoni, ...
International Conference on High Performance Computing, 118-136, 2021
委任: US National Science Foundation
Accelerating mpi all-to-all communication with online compression on modern gpu clusters
Q Zhou, P Kousha, Q Anthony, K Shafie Khorassani, A Shafi, ...
International Conference on High Performance Computing, 3-25, 2022
委任: US National Science Foundation
Adaptive and hierarchical large message all-to-all communication algorithms for large-scale dense gpu systems
KS Khorassani, CH Chu, QG Anthony, H Subramoni, DK Panda
2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet …, 2021
委任: US National Science Foundation
Dynamic kernel fusion for bulk non-contiguous data transfer on GPU clusters
CH Chu, KS Khorassani, Q Zhou, H Subramoni, DK Panda
2020 IEEE International Conference on Cluster Computing (CLUSTER), 130-141, 2020
委任: US National Science Foundation
High performance mpi over the slingshot interconnect: Early experiences
K Shafie Khorassani, CC Chen, B Ramesh, A Shafi, H Subramoni, ...
Practice and Experience in Advanced Research Computing, 1-7, 2022
委任: US National Science Foundation, US Department of Energy
Highly efficient alltoall and alltoallv communication algorithms for gpu systems
CC Chen, KS Khorassani, QG Anthony, A Shafi, H Subramoni, DK Panda
2022 IEEE International Parallel and Distributed Processing Symposium …, 2022
委任: US National Science Foundation
Implementing and Optimizing a GPU-aware MPI Library for Intel GPUs: Early Experiences
CC Chen, KS Khorassani, GKR Kuncham, R Vaidya, M Abduljabbar, ...
2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet …, 2023
委任: US National Science Foundation
Network assisted non-contiguous transfers for GPU-aware MPI libraries
KK Suresh, KS Khorassani, CC Chen, B Ramesh, M Abduljabbar, A Shafi, ...
2022 IEEE Symposium on High-Performance Interconnects (HOTI), 13-20, 2022
委任: US National Science Foundation
OMB-UM: Design, implementation, and evaluation of CUDA unified memory aware MPI benchmarks
KV Manian, CH Chu, AA Awan, KS Khorassani, H Subramoni, DK Panda
2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High …, 2019
委任: US National Science Foundation
Designing and Optimizing GPU-aware Nonblocking MPI Neighborhood Collective Communication for PETSc*
KS Khorassani, CC Chen, H Subramoni, DK Panda
2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023
委任: US National Science Foundation
MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators
CC Chen, K Shafie Khorassani, P Kousha, Q Zhou, J Yao, H Subramoni, ...
Proceedings of the SC'23 Workshops of The International Conference on High …, 2023
委任: US National Science Foundation
High Performance MPI over the Slingshot Interconnect
KS Khorassani, CC Chen, B Ramesh, A Shafi, H Subramoni, DK Panda
Journal of Computer Science and Technology 38 (1), 128-145, 2023
委任: US National Science Foundation, US Department of Energy
公開と助成金に関する情報は、コンピュータ プログラムによって自動的に決定されます