Parallel optimization and application of unstructured sparse triangular solver on new generation of sunway architecture J Li, L Li, Q Wang, W Xue, J Liang, J Shi Parallel Computing 120, 103080, 2024 | 3 | 2024 |
FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores J Shi, S Li, Y Xu, R Fu, X Wang, T Wu arXiv preprint arXiv:2412.11007, 2024 | 1 | 2024 |
New YARN sharing GPU based on graphics memory granularity scheduling J Shi, D Chen, J Liang, L Li, Y Lin, J Li Parallel Computing 117, 103038, 2023 | 1 | 2023 |
Toward efficient structured-grid triangular solver on sunway many-core processors J Li, J Liang, W Xue, Z Hu, L Li, J Shi The Journal of Supercomputing 80 (8), 10610-10636, 2024 | | 2024 |