Follow
Dacheng Li
Dacheng Li
UC Berkeley; NVIDIA
Verified email at berkeley.edu - Homepage
Title
Cited by
Cited by
Year
Judging llm-as-a-judge with mt-bench and chatbot arena
L Zheng, WL Chiang, Y Sheng, S Zhuang, Z Wu, Y Zhuang, Z Lin, Z Li, ...
Advances in Neural Information Processing Systems 36, 46595-46623, 2023
3024*2023
Chatbot arena: An open platform for evaluating llms by human preference
WL Chiang, L Zheng, Y Sheng, AN Angelopoulos, T Li, D Li, H Zhang, ...
ICML 2024, 2024
3772024
How Long Can Context Length of Open-Source LLMs truly Promise?
D Li, R Shao, A Xie, Y Sheng, L Zheng, J Gonzalez, I Stoica, X Ma, ...
NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following, 2023
144*2023
Dual contradistinctive generative autoencoder
G Parmar, D Li, K Lee, Z Tu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
972021
SLoRA: Scalable Serving of Thousands of LoRA Adapters
Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ...
Proceedings of Machine Learning and Systems 6, 296-311, 2024
86*2024
Mpcformer: fast, performant and private transformer inference with mpc
D Li, R Shao, H Wang, H Guo, EP Xing, H Zhang
The Eleventh International Conference on Learning Representations, 2022
752022
Fairness in serving large language models
Y Sheng, S Cao, D Li, B Zhu, Z Li, D Zhuo, JE Gonzalez, I Stoica
18th USENIX Symposium on Operating Systems Design and Implementation (OSDI’24), 2023
392023
Vila-u: a unified foundation model integrating visual understanding and generation
Y Wu, Z Zhang, J Chen, H Tang, D Li, Y Fang, L Zhu, E Xie, H Yin, L Yi, ...
arXiv preprint arXiv:2409.04429, 2024
342024
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
F Xue, Y Chen, D Li, Q Hu, L Zhu, X Li, Y Fang, H Tang, S Yang, Z Liu, ...
arXiv preprint arXiv:2408.10188, 2024
292024
Sorry-bench: Systematically evaluating large language model safety refusal behaviors
T Xie, X Qi, Y Zeng, Y Huang, UM Sehwag, K Huang, L He, B Wei, D Li, ...
arXiv preprint arXiv:2406.14598, 2024
282024
Distflashattn: Distributed memory-efficient attention for long-context llms training
D Li, R Shao, A Xie, EP Xing, X Ma, I Stoica, JE Gonzalez, H Zhang
First Conference on Language Modeling, 2024
24*2024
Amp: Automatically finding model parallel strategies with heterogeneity awareness
D Li, H Wang, E Xing, H Zhang
Advances in Neural Information Processing Systems 35, 6630-6639, 2022
202022
NVILA: Efficient frontier visual language models
Z Liu, L Zhu, B Shi, Z Zhang, Y Lou, S Yang, H Xi, S Cao, Y Gu, D Li, X Li, ...
arXiv preprint arXiv:2412.04468, 2024
62024
Does compressing activations help model parallel training?
S Bian, D Li, H Wang, E Xing, S Venkataraman
Proceedings of Machine Learning and Systems 6, 239-252, 2024
42024
MPC-Minimized Secure LLM Inference
D Rathee, D Li, I Stoica, H Zhang, R Popa
arXiv preprint arXiv:2408.03561, 2024
32024
Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
H Xi, S Yang, Y Zhao, C Xu, M Li, X Li, Y Lin, H Cai, J Zhang, D Li, J Chen, ...
arXiv preprint arXiv:2502.01776, 2025
2025
Locality-aware Fair Scheduling in LLM Serving
S Cao, Y Wang, Z Mao, PL Hsu, L Yin, T Xia, D Li, S Liu, Y Zhang, Y Zhou, ...
arXiv preprint arXiv:2501.14312, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–17