Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality WL Chiang, Z Li, Z Lin, Y Sheng, Z Wu, H Zhang, L Zheng, S Zhuang, ... See https://vicuna. lmsys. org (accessed 14 April 2023) 2 (3), 6, 2023 | 2882* | 2023 |
Judging llm-as-a-judge with mt-bench and chatbot arena L Zheng, WL Chiang, Y Sheng, S Zhuang, Z Wu, Y Zhuang, Z Lin, Z Li, ... Advances in Neural Information Processing Systems 36, 46595-46623, 2023 | 2780 | 2023 |
Alpa: Automating inter-and {Intra-Operator} parallelism for distributed deep learning L Zheng, Z Li, H Zhang, Y Zhuang, Z Chen, Y Huang, Y Wang, Y Xu, ... 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2022 | 351 | 2022 |
Lmsys-chat-1m: A large-scale real-world llm conversation dataset L Zheng, WL Chiang, Y Sheng, T Li, S Zhuang, Z Wu, Y Zhuang, Z Li, ... arXiv preprint arXiv:2309.11998, 2023 | 117 | 2023 |
Llm360: Towards fully transparent open-source llms Z Liu, A Qiao, W Neiswanger, H Wang, B Tan, T Tao, J Li, Y Wang, S Sun, ... arXiv preprint arXiv:2312.06550, 2023 | 56 | 2023 |
On optimizing the communication of model parallelism Y Zhuang, L Zheng, Z Li, E Xing, Q Ho, J Gonzalez, I Stoica, H Zhang, ... Proceedings of Machine Learning and Systems 5, 2023 | 37 | 2023 |
Toward Inference-optimal Mixture-of-Expert Large Language Models L Yun, Y Zhuang, Y Fu, EP Xing, H Zhang arXiv preprint arXiv:2404.02852, 2024 | 5 | 2024 |
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow Y Mei, Y Zhuang, X Miao, J Yang, Z Jia, R Vinayak Proceedings of the 30th ACM International Conference on Architectural …, 2025 | 4* | 2025 |
Mixture of experts enable efficient and effective protein understanding and design N Sun, S Zou, T Tao, S Mahbub, D Li, Y Zhuang, H Wang, X Cheng, ... bioRxiv, 2024.11. 29.625425, 2024 | 3 | 2024 |
LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch Z Liu, B Tan, H Wang, W Neiswanger, T Tao, H Li, F Koto, Y Wang, S Sun, ... arXiv preprint arXiv:2501.07124, 2025 | 1* | 2025 |
Accurate and general dna representations emerge from genome foundation models at scale CN Ellington, N Sun, N Ho, T Tao, S Mahbub, D Li, Y Zhuang, H Wang, ... bioRxiv, 2024.12. 01.625444, 2024 | 1 | 2024 |
RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs B Tan, Y Zhu, L Liu, H Wang, Y Zhuang, J Chen, E Xing, Z Hu Proceedings of the 2024 Conference of the North American Chapter of the …, 2024 | 1 | 2024 |
Scaling dense representations for single cell with transcriptome-scale context N Ho, CN Ellington, J Hou, S Addagudi, S Mo, T Tao, D Li, Y Zhuang, ... bioRxiv, 2024.11. 28.625303, 2024 | 1 | 2024 |
A large-scale foundation model for rna function and structure prediction S Zou, T Tao, S Mahbub, CN Ellington, RJ Algayres, D Li, Y Zhuang, ... bioRxiv, 2024.11. 28.625345, 2024 | 1 | 2024 |
Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs B Tan, Y Zhu, L Liu, H Wang, Y Zhuang, J Chen, E Xing, Z Hu arXiv preprint arXiv:2310.16355, 2023 | 1 | 2023 |