Designing functionalized graphene-stitched-SiC/fluoropolymer novel composite coating with excellent corrosion resistance and hydrogen diffusion barrier properties S Yuan, K Li, Y Sun, C Cong, Y Liu, D Lin, L Pei, Y Zhu, H Wang Chemical Engineering Journal 472, 144881, 2023 | 17 | 2023 |
OpenKMC: a KMC design for hundred-billion-atom simulation using millions of cores on Sunway Taihulight K Li, H Shang, Y Zhang, S Li, B Wu, D Wang, L Zhang, F Li, D Chen, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 16 | 2019 |
Temporal vectorization for stencils L Yuan, H Cao, Y Zhang, K Li, P Lu, Y Yue Proceedings of the International Conference for High Performance Computing …, 2021 | 12 | 2021 |
Reducing redundancy in data organization and arithmetic calculation for stencil computations K Li, L Yuan, Y Zhang, Y Yue Proceedings of the International Conference for High Performance Computing …, 2021 | 12 | 2021 |
Communication-avoiding for dynamical core of atmospheric general circulation model J Xiao, S Li, B Wu, H Zhang, K Li, E Yao, Y Zhang, G Tan Proceedings of the 47th International Conference on Parallel Processing, 1-10, 2018 | 9 | 2018 |
Convstencil: Transform stencil computation to matrix multiplication on tensor cores Y Chen, K Li, Y Wang, D Bai, L Wang, L Ma, L Yuan, Y Zhang, T Cao, ... Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024 | 6 | 2024 |
Agcm-3dlf: accelerating atmospheric general circulation model via 3-d parallelization and leap-format H Cao, L Yuan, H Zhang, Y Zhang, B Wu, K Li, S Li, M Zhang, P Lu, J Xiao IEEE Transactions on Parallel and Distributed Systems 34 (3), 766-780, 2022 | 6 | 2022 |
An efficient vectorization scheme for stencil computation K Li, L Yuan, Y Zhang, Y Yue, H Cao 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022 | 4 | 2022 |
AutoFlow: Hotspot-aware, dynamic load balancing for distributed stream processing P Lu, Y Yue, L Yuan, Y Zhang International Conference on Algorithms and Architectures for Parallel …, 2021 | 4 | 2021 |
Egpuip: An embedded gpu accelerated library for image processing L Wang, H Jia, Y Zhang, K Li, C Wei 2022 IEEE 24th Int Conf on High Performance Computing & Communications; 8th …, 2022 | 3 | 2022 |
An accurate and efficient large-scale regression method through best friend clustering K Li, L Yuan, Y Zhang, G Chen IEEE Transactions on Parallel and Distributed Systems 33 (11), 3129-3140, 2021 | 3 | 2021 |
Correction to: FastNBL: fast neighbor lists establishment for molecular dynamics simulation based on bitwise operations K Li, S Li, S Huang, Y Chen, Y Zhang The Journal of Supercomputing 75, 8339-8340, 2019 | 2 | 2019 |
LONG EXPOSURE: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy Sparsity T Wang, K Li, Z Hao, D Bai, J Ren, Y Zhang, T Cao, M Yang SC24: International Conference for High Performance Computing, Networking …, 2024 | 1 | 2024 |
LoRAStencil: Low-Rank Adaptation of Stencil Computation on Tensor Cores Y Zhang, K Li, L Yuan, J Cheng, Y Zhang, T Cao, M Yang SC24: International Conference for High Performance Computing, Networking …, 2024 | 1 | 2024 |
SBoRA: Low-Rank Adaptation with Regional Weight Updates LM Po, Y Liu, H Wu, T Zhang, WY Yu, Z Wang, Z Jiang, K Li arXiv preprint arXiv:2407.05413, 2024 | 1 | 2024 |
VNEC: A Vectorized Non-Empty Column Format for SpMV on CPUs L Wang, H Jia, L Xu, C Wei, K Li, X Jiang, Y Zhang 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2024 | 1 | 2024 |
LBBGEMM: A Load-balanced Batch GEMM Framework on ARM CPU s C Wei, H Jia, Y Zhang, K Li, L Wang 2022 IEEE 24th Int Conf on High Performance Computing & Communications; 8th …, 2022 | 1 | 2022 |
swMD: Performance Optimizations for Molecular Dynamics Simulation on Sunway Taihulight K Li, S Li, B Wang, Y Chen, Y Zhang 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications …, 2019 | 1 | 2019 |
Implementation and Optimization of Multidimensional FFT Algorithm on Large-Scale Clusters LI Kun, JIA Haipeng, CAO Ting, Z Yunquan Journal of Frontiers of Computer Science & Technology 11 (6), 863, 2017 | 1 | 2017 |
LeMo: Enabling LEss Token Involvement for MOre Context Fine-tuning T Wang, X Chen, K Li, T Cao, J Ren, Y Zhang arXiv preprint arXiv:2501.09767, 2025 | | 2025 |