オープンアクセス

オープンアクセスを義務付けられた論文 - Kawthar Shafie Khorassani詳細

一般には非公開: 1 件

確認

High-performance adaptive MPI derived datatype communication for modern Multi-GPU systems

CH Chu, JM Hashmi, KS Khorassani, H Subramoni, DK Panda

2019 IEEE 26th International Conference on High Performance Computing, Data …, 2019

委任: US National Science Foundation

一般公開: 13 件

[PDF] nsf.gov

確認

Nv-group: link-efficient reduction for distributed deep learning on modern dense gpu systems

CH Chu, P Kousha, AA Awan, KS Khorassani, H Subramoni, DK Panda

Proceedings of the 34th ACM International Conference on Supercomputing, 1-12, 2020

委任: US National Science Foundation

[PDF] nsf.gov

確認

Designing a ROCm-aware MPI library for AMD GPUs: early experiences

K Shafie Khorassani, J Hashmi, CH Chu, CC Chen, H Subramoni, ...

International Conference on High Performance Computing, 118-136, 2021

委任: US National Science Foundation

[PDF] nsf.gov

確認

Accelerating mpi all-to-all communication with online compression on modern gpu clusters

Q Zhou, P Kousha, Q Anthony, K Shafie Khorassani, A Shafi, ...

International Conference on High Performance Computing, 3-25, 2022

委任: US National Science Foundation

[PDF] nsf.gov

確認

Adaptive and hierarchical large message all-to-all communication algorithms for large-scale dense gpu systems

KS Khorassani, CH Chu, QG Anthony, H Subramoni, DK Panda

2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet …, 2021

委任: US National Science Foundation

[PDF] nsf.gov

確認

Dynamic kernel fusion for bulk non-contiguous data transfer on GPU clusters

CH Chu, KS Khorassani, Q Zhou, H Subramoni, DK Panda

2020 IEEE International Conference on Cluster Computing (CLUSTER), 130-141, 2020

委任: US National Science Foundation

[PDF] nsf.gov

確認

High performance mpi over the slingshot interconnect: Early experiences

K Shafie Khorassani, CC Chen, B Ramesh, A Shafi, H Subramoni, ...

Practice and Experience in Advanced Research Computing, 1-7, 2022

委任: US National Science Foundation, US Department of Energy

[PDF] nsf.gov

確認

Highly efficient alltoall and alltoallv communication algorithms for gpu systems

CC Chen, KS Khorassani, QG Anthony, A Shafi, H Subramoni, DK Panda

2022 IEEE International Parallel and Distributed Processing Symposium …, 2022

委任: US National Science Foundation

[PDF] nsf.gov

確認

Implementing and Optimizing a GPU-aware MPI Library for Intel GPUs: Early Experiences

CC Chen, KS Khorassani, GKR Kuncham, R Vaidya, M Abduljabbar, ...

2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet …, 2023

委任: US National Science Foundation

[PDF] nsf.gov

確認

Network assisted non-contiguous transfers for GPU-aware MPI libraries

KK Suresh, KS Khorassani, CC Chen, B Ramesh, M Abduljabbar, A Shafi, ...

2022 IEEE Symposium on High-Performance Interconnects (HOTI), 13-20, 2022

委任: US National Science Foundation

[PDF] supercomputing.org

確認

OMB-UM: Design, implementation, and evaluation of CUDA unified memory aware MPI benchmarks

KV Manian, CH Chu, AA Awan, KS Khorassani, H Subramoni, DK Panda

2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High …, 2019

委任: US National Science Foundation

[PDF] nsf.gov

確認

Designing and Optimizing GPU-aware Nonblocking MPI Neighborhood Collective Communication for PETSc^*

KS Khorassani, CC Chen, H Subramoni, DK Panda

2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023

委任: US National Science Foundation

[PDF] nsf.gov

確認

MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators

CC Chen, K Shafie Khorassani, P Kousha, Q Zhou, J Yao, H Subramoni, ...

Proceedings of the SC'23 Workshops of The International Conference on High …, 2023

委任: US National Science Foundation

[PDF] ict.ac.cn

確認

High Performance MPI over the Slingshot Interconnect

KS Khorassani, CC Chen, B Ramesh, A Shafi, H Subramoni, DK Panda

Journal of Computer Science and Technology 38 (1), 128-145, 2023

委任: US National Science Foundation, US Department of Energy

公開と助成金に関する情報は、コンピュータプログラムによって自動的に決定されます