The analysis of a plane wave pseudopotential density functional theory code on a GPU machine W Jia, Z Cao, L Wang, J Fu, X Chi, W Gao, LW Wang Computer Physics Communications 184 (1), 9-18, 2013 | 416 | 2013 |
Fast plane wave density functional theory molecular dynamics calculations on multi-GPU machines W Jia, J Fu, Z Cao, L Wang, X Chi, W Gao, LW Wang Journal of Computational Physics 251, 102-115, 2013 | 373 | 2013 |
DAPPLE: A pipelined data parallel approach for training large models S Fan, Y Rong, C Meng, Z Cao, S Wang, Z Zheng, C Wu, G Long, J Yang, ... Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of …, 2021 | 238 | 2021 |
Brief view on requirements and development of high performance computing application. Y Zhao, P Zhu, X Chi, T Niu, Z Cao Jisuanji Yanjiu yu Fazhan(Computer Research and Development) 44 (10), 1640-1646, 2007 | 30* | 2007 |
Parallel simulation of high‐dimensional American option pricing based on CPU versus MIC Y Hu, Q Li, Z Cao, J Wang Concurrency and Computation: Practice and Experience 27 (5), 1110-1121, 2015 | 22 | 2015 |
PHoToNs–A parallel heterogeneous and threads oriented code for cosmological N-body simulation Q Wang, ZY Cao, L Gao, XB Chi, C Meng, J Wang, L Wang Research in Astronomy and Astrophysics 18 (6), 062, 2018 | 12 | 2018 |
Hap: Spmd dnn training on heterogeneous gpu clusters with automated program synthesis S Zhang, L Diao, C Wu, Z Cao, S Wang, W Lin Proceedings of the Nineteenth European Conference on Computer Systems, 524-541, 2024 | 11 | 2024 |
USGPA: a user-centric and secure grid portal architecture for high-performance computing R Cao, X Chi, Z Cao, Z Dai, H Xiao 2009 IEEE International Symposium on Parallel and Distributed Processing …, 2009 | 9 | 2009 |
Large-Scale Parallelization Based on CPU and GPU Cluster for Cosmological Fluid Simulations C Meng, L Wang, Z Cao, L Feng, W Zhu Computers & Fluids, 2014 | 8 | 2014 |
Acceleration of a high order finite-difference WENO scheme for large-scale cosmological simulations on GPU C Meng, L Wang, Z Cao, X Ye, LL Feng 2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013 | 8 | 2013 |
RMI-based grid re-development model for high-performance computing RQ Cao, ZY Cao, XB Chi, HL Xiao Journal of Computer Applications 9, 069, 2010 | 6* | 2010 |
Auto-parallelizing large models with rhino: A systematic approach on production ai platform S Zhang, L Diao, S Wang, Z Cao, Y Gu, C Si, Z Shi, Z Zheng, C Wu, W Lin arXiv preprint arXiv:2302.08141, 2023 | 5 | 2023 |
Runtime Environment Configuration and Optimization on High-performance Computing Clusters C Zongyan e-Science Technology & Application 2 (5), 52-61, 2011 | 4* | 2011 |
Resilience framework of three-layer supercomputing environment Z DAI, H XIAO, R CAO, X CHI, Z CAO Application Research of Computers 7, 050, 2011 | 4* | 2011 |
Design and implementation of computation quota system for supercomputing environment T Niu, P Zhu, Y Zhao, ZY Cao Jisuanji Yingyong/ Journal of Computer Applications 30, 2010 | 4* | 2010 |
Parallelization and optimization of huge scale sequence alignment computation Z Cao, X Lang, X Liu, X Chi Journal of Computer Applications 31 (S2), 2011 | 3* | 2011 |
The Comparison of Different Level Faulttolerant MPI Parallel Program Y Zhao, Z Cao, P Zhu, X Chi e-Science Technology & Application 2 (6), 14-21, 2011 | 3* | 2011 |
Priority scheduling of cluster jobs based on user evaluation Z Cao, Y Zhao, T Niu, P Zhu, X Chi Huazhong Keji Daxue Xuebao(Ziran Kexue Ban)/ Journal of Huazhong University …, 2011 | 3* | 2011 |
Ada-Grouper: Accelerating Pipeline Parallelism in Preempted Network by Adaptive Group-Scheduling for Micro-Batches S Wang, Z Cao, C Si, L Diao, J Wang, W Lin arXiv preprint arXiv:2303.01675, 2023 | 2 | 2023 |
First Principle Calculation on Multi-GPU Machines W Jia, Z Cao, J Fu, L Wang e-Science Technology & Application 3 (5), 69-75, 2012 | 1* | 2012 |