关注
Lijuan Jiang
Lijuan Jiang
Shanghai AI Lab
在 pjlab.org.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Towards highly efficient DGEMM on the emerging SW26010 many-core processor
L Jiang, C Yang, Y Ao, W Yin, W Ma, Q Sun, F Liu, R Lin, P Zhang
2017 46th International Conference on Parallel Processing (ICPP), 422-431, 2017
492017
Performance optimization of the HPCG benchmark on the Sunway TaihuLight supercomputer
Y Ao, C Yang, F Liu, W Yin, L Jiang, Q Sun
ACM Transactions on Architecture and Code Optimization (TACO) 15 (1), 1-20, 2018
372018
Enabling highly efficient batched matrix multiplications on SW26010 many-core processor
L Jiang, C Yang, W Ma
ACM Transactions on Architecture and Code Optimization (TACO) 17 (1), 1-23, 2020
152020
xMath2. 0: a high-performance extended math library for SW26010-Pro many-core processor
F Liu, W Ma, Y Zhao, D Chen, Y Hu, Q Lu, WW Yin, X Yuan, L Jiang, ...
CCF Transactions on High Performance Computing 5 (1), 56-71, 2023
102023
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey
J Duan, S Zhang, Z Wang, L Jiang, W Qu, Q Hu, G Wang, Q Weng, H Yan, ...
arXiv preprint arXiv:2407.20018, 2024
72024
EasyView: Enabling and Scheduling Tensor Views in Deep Learning Compilers
L Jiang, P Xu, Q Zhu, X Li, S Yan, X Zhang, D Lin, W Ma, Z Li, J Liu, J Ma, ...
Proceedings of the 51st International Conference on Parallel Processing, 1-11, 2022
22022
A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning
J Ma, X Li, Z Wang, X Zhang, S Yan, Y Chen, Y Zhang, M Jin, L Jiang, ...
Proceedings of the 61st ACM/IEEE Design Automation Conference, 1-6, 2024
2024
ZeroPP: Unleashing Exceptional Parallelism Efficiencythrough Tensor-Parallelism-Free Methodology
D Tang, L Jiang, J Zhou, M Jin, H Li, X Zhang, Z Pei, J Zhai
arXiv preprint arXiv:2402.03791, 2024
2024
LongTail-Bench: A Benchmark Suite for Domain-Specific Operators in Deep Learning
X Li, S Yan, L Jiang, P Xu, J Ma, X Zhang, D Lin
2022 IEEE International Symposium on Workload Characterization (IISWC), 282-295, 2022
2022
系统目前无法执行此操作,请稍后再试。
文章 1–9