DGCL: An efficient communication library for distributed GNN training Z Cai, X Yan, Y Wu, K Ma, J Cheng, F Yu Proceedings of the Sixteenth European Conference on Computer Systems, 130-144, 2021 | 104 | 2021 |
Flexps: Flexible parallelism control in parameter server architecture Y Huang, T Jin, Y Wu, Z Cai, X Yan, F Yang, J Li, Y Guo, J Cheng Proceedings of the VLDB Endowment 11 (5), 566-579, 2018 | 88 | 2018 |
Seastar: vertex-centric programming for graph neural networks Y Wu, K Ma, Z Cai, T Jin, B Li, C Zheng, J Cheng, F Yu Proceedings of the Sixteenth European Conference on Computer Systems, 359-375, 2021 | 61 | 2021 |
Elastic deep learning in multi-tenant GPU clusters Y Wu, K Ma, X Yan, Z Liu, Z Cai, Y Huang, J Cheng, H Yuan, F Yu IEEE Transactions on Parallel and Distributed Systems 33 (1), 144-158, 2021 | 52 | 2021 |
Tensoropt: Exploring the tradeoffs in distributed dnn training with auto-parallelism Z Cai, X Yan, K Ma, Y Wu, Y Huang, J Cheng, T Su, F Yu IEEE Transactions on Parallel and Distributed Systems 33 (8), 1967-1981, 2021 | 42 | 2021 |
Improving resource utilization by timely fine-grained scheduling T Jin, Z Cai, B Li, C Zheng, G Jiang, J Cheng Proceedings of the Fifteenth European Conference on Computer Systems, 1-16, 2020 | 37 | 2020 |
DSP: Efficient GNN training with multiple GPUs Z Cai, Q Zhou, X Yan, D Zheng, X Song, C Zheng, J Cheng, G Karypis Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and …, 2023 | 27 | 2023 |
Dgi: An easy and efficient framework for gnn model evaluation P Yin, X Yan, J Zhou, Q Fu, Z Cai, J Cheng, B Tang, M Wang Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 10 | 2023 |
Scalable de novo genome assembly using pregel D Yan, H Chen, J Cheng, Z Cai, B Shao 2018 IEEE 34th International Conference on Data Engineering (ICDE), 1216-1219, 2018 | 10 | 2018 |
gSampler: General and efficient GPU-based graph sampling for graph learning P Gong, R Liu, Z Mao, Z Cai, X Yan, C Li, M Wang, Z Li Proceedings of the 29th Symposium on Operating Systems Principles, 562-578, 2023 | 5 | 2023 |
MuseGNN: Interpretable and convergent graph neural network layers at scale H Jiang, R Liu, X Yan, Z Cai, M Wang, D Wipf arXiv preprint arXiv:2310.12457, 2023 | 4 | 2023 |
FEC: Efficient Deep Recommendation Model Training with Flexible Embedding Communication K Ma, X Yan, Z Cai, Y Huang, Y Wu, J Cheng Proceedings of the ACM on Management of Data 1 (2), 1-21, 2023 | 4 | 2023 |
4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBs M Wang, Q Gan, D Wipf, Z Cai, N Li, J Tang, Y Zhang, Z Zhang, Z Mao, ... arXiv preprint arXiv:2404.18209, 2024 | 3 | 2024 |
DGI: Easy and Efficient Inference for GNNs P Yin, X Yan, J Zhou, Q Fu, Z Cai, J Cheng, B Tang, M Wang arXiv preprint arXiv:2211.15082, 2022 | 2 | 2022 |
DiskGNN: Bridging I/O Efficiency and Model Accuracy for Out-of-Core GNN Training R Liu, Y Wang, X Yan, Z Cai, M Wang, H Jiang, B Tang, J Li arXiv preprint arXiv:2405.05231, 2024 | 1 | 2024 |
DF-GNN: Dynamic Fusion Framework for Attention Graph Neural Networks on GPUs J Liu, Z Cai, Z Chen, M Wang arXiv preprint arXiv:2411.16127, 2024 | | 2024 |
PPS: Fair and efficient black-box scheduling for multi-tenant GPU clusters K Ma, Z Cai, X Yan, Y Zhang, Z Liu, Y Feng, C Li, W Lin, J Cheng Parallel Computing 120, 103082, 2024 | | 2024 |
Towards Efficient Training for Large-Scale Deep Learning Models Z Cai PQDT-Global, 2022 | | 2022 |
DGCL Z Cai, X Yan, Y Wu, K Ma, J Cheng, F Yu Proceedings of the Sixteenth European Conference on Computer Systems, 2021 | | 2021 |
4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on RDBs M Wang, Q Gan, D Wipf, Z Zhang, C Faloutsos, W Zhang, M Zhang, Z Cai, ... The Thirty-eight Conference on Neural Information Processing Systems …, 0 | | |