ติดตาม
Ying Sheng
Ying Sheng
xAI
ยืนยันอีเมลแล้วที่ x.ai - หน้าแรก
ชื่อ
อ้างโดย
อ้างโดย
ปี
Judging llm-as-a-judge with mt-bench and chatbot arena
L Zheng, WL Chiang, Y Sheng, S Zhuang, Z Wu, Y Zhuang, Z Lin, Z Li, ...
Advances in Neural Information Processing Systems 36, 46595-46623, 2023
3269*2023
Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality
WL Chiang, Z Li, Z Lin, Y Sheng, Z Wu, H Zhang, L Zheng, S Zhuang, ...
See https://vicuna. lmsys. org (accessed 14 April 2023) 2 (3), 6, 2023
2640*2023
Efficient memory management for large language model serving with pagedattention
W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng, CH Yu, J Gonzalez, H Zhang, ...
Proceedings of the 29th Symposium on Operating Systems Principles, 611-626, 2023
13802023
cvc5: A versatile and industrial-strength SMT solver
H Barbosa, C Barrett, M Brain, G Kremer, H Lachnitt, M Mann, ...
International Conference on Tools and Algorithms for the Construction and …, 2022
5302022
Chatbot arena: An open platform for evaluating llms by human preference
WL Chiang, L Zheng, Y Sheng, AN Angelopoulos, T Li, D Li, B Zhu, ...
Forty-first International Conference on Machine Learning, 2024
4252024
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU
Y Sheng, L Zheng, B Yuan, Z Li, M Ryabinin, B Chen, P Liang, C Re, ...
International Conference on Machine Learning, 2023
3602023
H2o: Heavy-hitter oracle for efficient generative inference of large language models
Z Zhang, Y Sheng, T Zhou, T Chen, L Zheng, R Cai, Z Song, Y Tian, C Ré, ...
Advances in Neural Information Processing Systems 36, 34661-34710, 2023
3172023
How long can context length of open-source llms truly promise?
D Li, R Shao, A Xie, Y Sheng, L Zheng, J Gonzalez, I Stoica, X Ma, ...
NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following, 2023
152*2023
{AlpaServe}: Statistical multiplexing with model parallelism for deep learning serving
Z Li, L Zheng, Y Zhong, V Liu, Y Sheng, X Jin, Y Huang, Z Chen, H Zhang, ...
17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2023
1402023
Lmsys-chat-1m: A large-scale real-world llm conversation dataset
L Zheng, WL Chiang, Y Sheng, T Li, S Zhuang, Z Wu, Y Zhuang, Z Li, ...
arXiv preprint arXiv:2309.11998, 2023
1312023
Sglang: Efficient execution of structured language model programs
L Zheng, L Yin, Z Xie, CL Sun, J Huang, CH Yu, S Cao, C Kozyrakis, ...
Advances in Neural Information Processing Systems 37, 62557-62583, 2025
115*2025
Slora: Scalable serving of thousands of lora adapters
Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ...
Proceedings of Machine Learning and Systems 6, 296-311, 2024
93*2024
Fairness in serving large language models
Y Sheng, S Cao, D Li, B Zhu, Z Li, D Zhuo, JE Gonzalez, I Stoica
18th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2024
422024
Subspace embedding and linear regression with orlicz norm
A Andoni, C Lin, Y Sheng, P Zhong, R Zhong
International Conference on Machine Learning, 224-233, 2018
402018
Sorry-bench: Systematically evaluating large language model safety refusal behaviors
T Xie, X Qi, Y Zeng, Y Huang, UM Sehwag, K Huang, L He, B Wei, D Li, ...
arXiv preprint arXiv:2406.14598, 2024
392024
Clover: Closed-loop verifiable code generation
C Sun, Y Sheng, O Padon, C Barrett
AI Verification: First International Symposium, SAIV 2024, Montreal, QC …, 2024
322024
Distribution-free junta testing
X Chen, Z Liu, RA Servedio, Y Sheng, J Xie
STOC 2018, 2018
30*2018
Towards Optimal Caching and Model Selection for Large Model Inference
B Zhu, Y Sheng, L Zheng, C Barrett, M Jordan, J Jiao
Advances in Neural Information Processing Systems 36, 59062-59094, 2023
29*2023
Reasoning about vectors using an SMT theory of sequences
Y Sheng, A Nötzli, A Reynolds, Y Zohar, D Dill, W Grieskamp, J Park, ...
International Joint Conference on Automated Reasoning, 125-143, 2022
14*2022
Politeness for the theory of algebraic datatypes
Y Sheng, Y Zohar, C Ringeissen, J Lange, P Fontaine, C Barrett
International Joint Conference on Automated Reasoning, 238-255, 2020
13*2020
ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง
บทความ 1–20