Follow
Jianbo Hu
Jianbo Hu
Verified email at stu.pku.edu.cn
Title
Cited by
Cited by
Year
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving
Y Zhong, S Liu, J Chen, J Hu, Y Zhu, X Liu, X Jin, H Zhang
arXiv preprint arXiv:2401.09670, 2024
1072024
{DistServe}: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving
Y Zhong, S Liu, J Chen, J Hu, Y Zhu, X Liu, X Jin, H Zhang
18th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–2