Ying Sheng

อ้างโดย

	ทั้งหมด	ตั้งแต่ปี 2020
การอ้างอิง	9832	9796
ดัชนี h	18	18
ดัชนี i10	21	21

7000

3500

1750

5250

202220232024202585 1438 6915 1285

การเข้าถึงแบบสาธารณะ

ดูทั้งหมด

10 บทความ

1 บทความ

ใช้งานได้

ใช้ไม่ได้

อิงตามข้อกำหนดในการรับเงินสนับสนุน

ผู้เขียนร่วม

Lianmin ZhengxAIยืนยันอีเมลแล้วที่ x.ai
Ion StoicaProfessor of Computer Science, UC Berkeleyยืนยันอีเมลแล้วที่ cs.berkeley.edu
Joseph E. GonzalezProfessor of Computer Science, UC Berkeleyยืนยันอีเมลแล้วที่ berkeley.edu
Clark BarrettStanford Universityยืนยันอีเมลแล้วที่ cs.stanford.edu
Yoni ZoharBar Ilan Universityยืนยันอีเมลแล้วที่ biu.ac.il
Beidi ChenCarnegie Mellon Universityยืนยันอีเมลแล้วที่ andrew.cmu.edu

ติดตาม

Ying Sheng

xAI

ยืนยันอีเมลแล้วที่ x.ai - หน้าแรก

Large Language Models Machine Learning Systems Formal Verification


ชื่อ เรียงตามการอ้างอิง เรียงตามปี เรียงตามชื่อ	อ้างโดย อ้างโดย	ปี
Judging llm-as-a-judge with mt-bench and chatbot arena L Zheng, WL Chiang, Y Sheng, S Zhuang, Z Wu, Y Zhuang, Z Lin, Z Li, ... Advances in Neural Information Processing Systems 36, 46595-46623, 2023	3269*	2023
Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality WL Chiang, Z Li, Z Lin, Y Sheng, Z Wu, H Zhang, L Zheng, S Zhuang, ... See https://vicuna. lmsys. org (accessed 14 April 2023) 2 (3), 6, 2023	2640*	2023
Efficient memory management for large language model serving with pagedattention W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng, CH Yu, J Gonzalez, H Zhang, ... Proceedings of the 29th Symposium on Operating Systems Principles, 611-626, 2023	1380	2023
cvc5: A versatile and industrial-strength SMT solver H Barbosa, C Barrett, M Brain, G Kremer, H Lachnitt, M Mann, ... International Conference on Tools and Algorithms for the Construction and …, 2022	530	2022
Chatbot arena: An open platform for evaluating llms by human preference WL Chiang, L Zheng, Y Sheng, AN Angelopoulos, T Li, D Li, B Zhu, ... Forty-first International Conference on Machine Learning, 2024	425	2024
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU Y Sheng, L Zheng, B Yuan, Z Li, M Ryabinin, B Chen, P Liang, C Re, ... International Conference on Machine Learning, 2023	360	2023
H2o: Heavy-hitter oracle for efficient generative inference of large language models Z Zhang, Y Sheng, T Zhou, T Chen, L Zheng, R Cai, Z Song, Y Tian, C Ré, ... Advances in Neural Information Processing Systems 36, 34661-34710, 2023	317	2023
How long can context length of open-source llms truly promise? D Li, R Shao, A Xie, Y Sheng, L Zheng, J Gonzalez, I Stoica, X Ma, ... NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following, 2023	152*	2023
{AlpaServe}: Statistical multiplexing with model parallelism for deep learning serving Z Li, L Zheng, Y Zhong, V Liu, Y Sheng, X Jin, Y Huang, Z Chen, H Zhang, ... 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2023	140	2023
Lmsys-chat-1m: A large-scale real-world llm conversation dataset L Zheng, WL Chiang, Y Sheng, T Li, S Zhuang, Z Wu, Y Zhuang, Z Li, ... arXiv preprint arXiv:2309.11998, 2023	131	2023
Sglang: Efficient execution of structured language model programs L Zheng, L Yin, Z Xie, CL Sun, J Huang, CH Yu, S Cao, C Kozyrakis, ... Advances in Neural Information Processing Systems 37, 62557-62583, 2025	115*	2025
Slora: Scalable serving of thousands of lora adapters Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ... Proceedings of Machine Learning and Systems 6, 296-311, 2024	93*	2024
Fairness in serving large language models Y Sheng, S Cao, D Li, B Zhu, Z Li, D Zhuo, JE Gonzalez, I Stoica 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2024	42	2024
Subspace embedding and linear regression with orlicz norm A Andoni, C Lin, Y Sheng, P Zhong, R Zhong International Conference on Machine Learning, 224-233, 2018	40	2018
Sorry-bench: Systematically evaluating large language model safety refusal behaviors T Xie, X Qi, Y Zeng, Y Huang, UM Sehwag, K Huang, L He, B Wei, D Li, ... arXiv preprint arXiv:2406.14598, 2024	39	2024
Clover: Closed-loop verifiable code generation C Sun, Y Sheng, O Padon, C Barrett AI Verification: First International Symposium, SAIV 2024, Montreal, QC …, 2024	32	2024
Distribution-free junta testing X Chen, Z Liu, RA Servedio, Y Sheng, J Xie STOC 2018, 2018	30*	2018
Towards Optimal Caching and Model Selection for Large Model Inference B Zhu, Y Sheng, L Zheng, C Barrett, M Jordan, J Jiao Advances in Neural Information Processing Systems 36, 59062-59094, 2023	29*	2023
Reasoning about vectors using an SMT theory of sequences Y Sheng, A Nötzli, A Reynolds, Y Zohar, D Dill, W Grieskamp, J Park, ... International Joint Conference on Automated Reasoning, 125-143, 2022	14*	2022
Politeness for the theory of algebraic datatypes Y Sheng, Y Zohar, C Ringeissen, J Lange, P Fontaine, C Barrett International Joint Conference on Automated Reasoning, 238-255, 2020	13*	2020

ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง

บทความ 1–20

การอ้างอิงต่อปี

การอ้างอิงซ้ำกัน

การอ้างอิงที่รวมเข้าด้วยกัน

เพิ่มผู้เขียนร่วมผู้เขียนร่วม

ติดตาม

อ้างโดย

ผู้เขียนร่วม