Polynet: A pursuit of structural diversity in very deep networks X Zhang, Z Li, C Change Loy, D Lin Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 323 | 2017 |
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024 | 215 | 2024 |
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024 | 215 | 2024 |
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, H Duan, ... arXiv preprint arXiv:2309.15112, 2023 | 184 | 2023 |
Optimizing video object detection via a scale-time lattice K Chen, J Wang, S Yang, X Zhang, Y Xiong, CC Loy, D Lin Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 140 | 2018 |
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ... arXiv preprint arXiv:2404.06512, 2024 | 107 | 2024 |
Accelerated training for massive classification via dynamic class selection X Zhang, L Yang, J Yan, D Lin Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 43 | 2018 |
Poly-pc: A polyhedral network for multiple point cloud tasks at once T Xie, S Wang, K Wang, L Yang, Z Jiang, X Zhang, K Dai, R Li, J Cheng Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 20 | 2023 |
Elan: Towards generic and efficient elastic training for deep learning L Xie, J Zhai, B Wu, Y Wang, X Zhang, P Sun, S Yan 2020 IEEE 40th International Conference on Distributed Computing Systems …, 2020 | 20 | 2020 |
SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models H Duanmu, Z Yuan, X Li, J Duan, X Zhang, D Lin arXiv preprint arXiv:2405.06219, 2024 | 14 | 2024 |
Centauri: Enabling Efficient Scheduling for Communication-Computation Overlap in Large Model Training via Communication Partitioning C Chen, X Li, Q Zhu, J Duan, P Sun, X Zhang, C Yang Proceedings of the 29th ACM International Conference on Architectural …, 2024 | 14 | 2024 |
Mdl-nas: A joint multi-domain learning framework for vision transformer S Wang, T Xie, J Cheng, X Zhang, H Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 12 | 2023 |
Efficient training of large language models on distributed infrastructures: a survey J Duan, S Zhang, Z Wang, L Jiang, W Qu, Q Hu, G Wang, Q Weng, H Yan, ... arXiv preprint arXiv:2407.20018, 2024 | 7 | 2024 |
Effects of seed layer thickness and post-annealing process on crystalline quality of β-Ga2O3 films prepared on Si (100) substrate by RF magnetron sputtering W Mi, X Li, Y Ding, D Wang, M Xu, L Xiao, X Zhang, X Chen, B Li, L Luo, ... Vacuum 214, 112235, 2023 | 5 | 2023 |
Delta: Dynamically optimizing gpu memory beyond tensor recomputation Y Tang, C Wang, Y Zhang, Y Liu, X Zhang, L Qiao, Z Lai, D Li arXiv preprint arXiv:2203.15980, 2022 | 5 | 2022 |
Proteus: Simulating the performance of distributed DNN training J Duan, X Li, P Xu, X Zhang, S Yan, Y Liang, D Lin IEEE Transactions on Parallel and Distributed Systems, 2024 | 4 | 2024 |
MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving J Duan, R Lu, H Duanmu, X Li, X Zhang, D Lin, I Stoica, H Zhang Forty-first International Conference on Machine Learning, 0 | 4 | |
PSE-Net: Channel pruning for Convolutional Neural Networks with parallel-subnets estimator S Wang, T Xie, H Liu, X Zhang, J Cheng Neural Networks 174, 106263, 2024 | 2 | 2024 |
MuxServe: Flexible Multiplexing for Efficient Multiple LLM Serving J Duan, R Lu, H Duanmu, X Li, X Zhang, D Lin, I Stoica, H Zhang arXiv preprint arXiv:2404.02015, 2024 | 2 | 2024 |
EasyView: Enabling and Scheduling Tensor Views in Deep Learning Compilers L Jiang, P Xu, Q Zhu, X Li, S Yan, X Zhang, D Lin, W Ma, Z Li, J Liu, J Ma, ... Proceedings of the 51st International Conference on Parallel Processing, 1-11, 2022 | 2 | 2022 |