Legion: Automatically Pushing the Envelope of {Multi-GPU} System for {Billion-Scale}{GNN} Training J Sun, L Su, Z Shi, W Shen, Z Wang, L Wang, J Zhang, Y Li, W Yu, J Zhou, ...
2023 USENIX Annual Technical Conference (USENIX ATC 23), 165-179, 2023
22 2023 Staleness-Reduction Mini-Batch -Means X Zhu, J Sun, Z He, J Jiang, Z Wang
IEEE Transactions on Neural Networks and Learning Systems, 2023
7 2023 Helios: An Efficient Out-of-core GNN Training System on Terabyte-scale Graphs with In-memory Performance J Sun, M Sun, Z Zhang, J Xie, Z Shi, Z Yang, J Zhang, F Wu, Z Wang
arXiv preprint arXiv:2310.00837, 2023
5 2023 SSiMD: Supporting Six Signed Multiplications in a DSP Block for Low-Precision CNN on FPGAs Q Liu, M Sun, J Sun, L Lu, J Zhao, Z Wang
2023 International Conference on Field Programmable Technology (ICFPT), 161-169, 2023
4 2023 P4SGD: Programmable Switch Enhanced Model-Parallel Training on Generalized Linear Models on Distributed FPGAs H Huang, Y Li, J Sun, X Zhu, J Zhang, L Luo, J Li, Z Wang
IEEE Transactions on Parallel and Distributed Systems 34 (8), 2311-2324, 2023
3 2023 TorchGT: A Holistic System for Large-Scale Graph Transformer Training M Zhang, J Sun, Q Hu, P Sun, Z Wang, Y Wen, T Zhang
SC24: International Conference for High Performance Computing, Networking …, 2024
2024 SparseACC: A Generalized Linear Model Accelerator for Sparse Datasets J Zhang, H Huang, J Sun, JG Luna, O Mutlu, Z Wang
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2023
2023