Towards highly efficient DGEMM on the emerging SW26010 many-core processor L Jiang, C Yang, Y Ao, W Yin, W Ma, Q Sun, F Liu, R Lin, P Zhang 2017 46th International Conference on Parallel Processing (ICPP), 422-431, 2017 | 49 | 2017 |
Performance optimization of the HPCG benchmark on the Sunway TaihuLight supercomputer Y Ao, C Yang, F Liu, W Yin, L Jiang, Q Sun ACM Transactions on Architecture and Code Optimization (TACO) 15 (1), 1-20, 2018 | 37 | 2018 |
Enabling highly efficient batched matrix multiplications on SW26010 many-core processor L Jiang, C Yang, W Ma ACM Transactions on Architecture and Code Optimization (TACO) 17 (1), 1-23, 2020 | 15 | 2020 |
xMath2. 0: a high-performance extended math library for SW26010-Pro many-core processor F Liu, W Ma, Y Zhao, D Chen, Y Hu, Q Lu, WW Yin, X Yuan, L Jiang, ... CCF Transactions on High Performance Computing 5 (1), 56-71, 2023 | 10 | 2023 |
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey J Duan, S Zhang, Z Wang, L Jiang, W Qu, Q Hu, G Wang, Q Weng, H Yan, ... arXiv preprint arXiv:2407.20018, 2024 | 7 | 2024 |
EasyView: Enabling and Scheduling Tensor Views in Deep Learning Compilers L Jiang, P Xu, Q Zhu, X Li, S Yan, X Zhang, D Lin, W Ma, Z Li, J Liu, J Ma, ... Proceedings of the 51st International Conference on Parallel Processing, 1-11, 2022 | 2 | 2022 |
A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning J Ma, X Li, Z Wang, X Zhang, S Yan, Y Chen, Y Zhang, M Jin, L Jiang, ... Proceedings of the 61st ACM/IEEE Design Automation Conference, 1-6, 2024 | | 2024 |
ZeroPP: Unleashing Exceptional Parallelism Efficiencythrough Tensor-Parallelism-Free Methodology D Tang, L Jiang, J Zhou, M Jin, H Li, X Zhang, Z Pei, J Zhai arXiv preprint arXiv:2402.03791, 2024 | | 2024 |
LongTail-Bench: A Benchmark Suite for Domain-Specific Operators in Deep Learning X Li, S Yan, L Jiang, P Xu, J Ma, X Zhang, D Lin 2022 IEEE International Symposium on Workload Characterization (IISWC), 282-295, 2022 | | 2022 |