Bridging the gap between deep learning and frustrated quantum spin system for extreme-scale simulations on new generation of Sunway supercomputer M Li, J Chen, Q Xiao, F Wang, Q Jiang, X Zhao, R Lin, H An, X Liang, L He IEEE Transactions on Parallel and Distributed Systems 33 (11), 2846-2859, 2022 | 23 | 2022 |
High performance computing of DGDFT for tens of thousands of atoms using millions of cores on Sunway TaihuLight W Hu, X Qin, Q Jiang, J Chen, H An, W Jia, F Li, X Liu, D Chen, F Liu, ... Science Bulletin 66 (2), 111-119, 2021 | 22 | 2021 |
RDMA-based apache storm for high-performance stream data processing Z Zhang, Z Liu, Q Jiang, J Chen, H An International Journal of Parallel Programming 49 (5), 671-684, 2021 | 19 | 2021 |
2.5 million-atom ab initio electronic-structure simulation of complex metallic heterostructures with DGDFT W Hu, H An, Z Guo, Q Jiang, X Qin, J Chen, W Jia, C Yang, Z Luo, J Li, ... SC22: International Conference for High Performance Computing, Networking …, 2022 | 15 | 2022 |
Uncovering the performance bottleneck of modern HPC processor with static code analyzer: a case study on Kunpeng 920 S Tan, Q Jiang, Z Cao, X Hao, J Chen, H An CCF Transactions on High Performance Computing 6 (3), 343-364, 2024 | 5 | 2024 |
Accelerating parallel first-principles excited-state calculation by low-rank approximation with K-Means clustering Q Jiang, J Li, J Chen, X Qin, L Wan, J Yang, J Liu, W Hu, H An Proceedings of the 51st International Conference on Parallel Processing, 1-11, 2022 | 4 | 2022 |
High performance computing for first-principles Kohn-Sham density functional theory towards exascale supercomputers X Qin, J Chen, Z Luo, L Wan, J Li, S Jiao, Z Zhang, Q Jiang, W Hu, H An, ... CCF Transactions on High Performance Computing 5 (1), 26-42, 2023 | 3 | 2023 |
Quantifying throughput of basic blocks on arm microarchitectures by static code analyzers: A case study on kunpeng 920 Q Jiang, S Tan, Z Cao, X Hao, J Chen, H An 2022 IEEE 24th Int Conf on High Performance Computing & Communications; 8th …, 2022 | 2 | 2022 |
An efficient multi-GPU implementation for linear-response time-dependent density functional theory Q Jiang, L Wan, S Jiao, W Hu, J Chen, H An 2020 IEEE 22nd International Conference on High Performance Computing and …, 2020 | 2 | 2020 |
Gdarts: A gpu-based runtime system for dataflow task programming on dependency applications M Li, Q Jiang, H Lin, H An 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications …, 2019 | 2 | 2019 |
Extending the limit of LR-TDDFT on two different approaches: Numerical algorithms and new Sunway heterogeneous supercomputer Q Jiang, Z Cao, X Cui, L Wan, X Qin, H Cao, H An, J Chen, J Liu, W Hu, ... Parallel Computing 120, 103085, 2024 | 1 | 2024 |
A3PIM: An Automated, Analytic and Accurate Processing-in-Memory Offloader Q Jiang, S Tan, J Chen, H An 2024 Design, Automation & Test in Europe Conference & Exhibition (DATE), 1-6, 2024 | 1 | 2024 |
Enabling 13K-Atom Excited-State GW Calculations via Low-Rank Approximations and HPC on the New Sunway Supercomputer W Wu, Z Zhou, Q Jiang, J Feng, X Qin, H Ma, Z Cao, J Chen, S Chen, ... SC24: International Conference for High Performance Computing, Networking …, 2024 | | 2024 |
PWDFT-SW: Extending the Limit of Plane-Wave DFT Calculations to 16K Atoms on the New Sunway Supercomputer Q Jiang, Z Cao, J Chen, X Qin, W Hu, H An, J Yang arXiv preprint arXiv:2406.10765, 2024 | | 2024 |