Follow
Tianle Li
Tianle Li
Undergraduate Researcher, UC Berkeley
Verified email at berkeley.edu - Homepage
Title
Cited by
Cited by
Year
Chatbot arena: An open platform for evaluating llms by human preference
WL Chiang, L Zheng, Y Sheng, AN Angelopoulos, T Li, D Li, H Zhang, ...
arXiv preprint arXiv:2403.04132, 2024
3832024
Lmsys-chat-1m: A large-scale real-world llm conversation dataset
L Zheng, WL Chiang, Y Sheng, T Li, S Zhuang, Z Wu, Y Zhuang, Z Li, ...
arXiv preprint arXiv:2309.11998, 2023
1042023
From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline
T Li, WL Chiang, E Frick, L Dunlap, T Wu, B Zhu, JE Gonzalez, I Stoica
arXiv preprint arXiv:2406.11939, 2024
712024
From live data to high-quality benchmarks: The arena-hard pipeline
T Li, WL Chiang, E Frick, L Dunlap, B Zhu, JE Gonzalez, I Stoica
April, 2024
32*2024
SWAG: Storytelling With Action Guidance
J Pei, Z Patel, K El-Refai, T Li
Findings of the Association for Computational Linguistics: EMNLP 2024, 14086 …, 2024
5*2024
Does style matter? disentangling style and substance in chatbot arena, August 2024a
T Li, A Angelopoulos, WL Chiang
URL https://blog. lmarena. ai/blog/2024/style-control, 0
4
Athene-70b: Redefining the boundaries of post-training for open models, July 2024
E Frick, P Jin, T Li, K Ganesan, J Zhang, J Jiao, B Zhu
URL https://huggingface. co/Nexusflow/Athene-70B, 0
1
Project MPG: towards a generalized performance benchmark for LLM capabilities
L Spangher, T Li, WF Arnold, N Masiewicki, X Dotiwalla, R Parusmathi, ...
arXiv preprint arXiv:2410.22368, 2024
2024
How to Evaluate Reward Models for RLHF
E Frick, T Li, C Chen, WL Chiang, AN Angelopoulos, J Jiao, B Zhu, ...
arXiv preprint arXiv:2410.14872, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–9