Seuraa
Xiaozhe Yao
Xiaozhe Yao
Vahvistettu sähköpostiosoite verkkotunnuksessa inf.ethz.ch - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Dataperf: Benchmarks for data-centric ai development
M Mazumder, C Banbury, X Yao, B Karlaš, W Gaviria Rojas, S Diamos, ...
Advances in Neural Information Processing Systems 36, 5320-5347, 2023
148*2023
Redpajama: an open dataset for training large language models
M Weber, D Fu, Q Anthony, Y Oren, S Adams, A Alexandrov, X Lyu, ...
Advances in Neural Information Processing Systems 37, 116462-116492, 2025
21*2025
DMLR: Data-centric Machine Learning Research--Past, Present and Future
L Oala, M Maskey, L Bat-Leah, A Parrish, NM Gürel, TS Kuo, Y Liu, R Dror, ...
arXiv preprint arXiv:2311.13028, 2023
132023
Deltazip: Multi-tenant language model serving via delta compression
X Yao, Q Hu, A Klimovic
arXiv preprint arXiv:2312.05215, 2023
112023
Gpt-zip: Deep compression of finetuned large language models
B Isik, H Kumbong, W Ning, X Yao, S Koyejo, C Zhang
Workshop on Efficient Systems for Foundation Models@ ICML2023, 2023
112023
Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code
T Nakamura, M Mishra, S Tedeschi, Y Chai, JT Stillerman, F Friedrich, ...
arXiv preprint arXiv:2404.00399, 2024
8*2024
Shift: an efficient, flexible search engine for transfer learning
C Renggli, X Yao, L Kolar, L Rimanic, A Klimovic, C Zhang
arXiv preprint arXiv:2204.01457, 2022
42022
Face based advertisement recommendation with deep learning: a case study
X Yao, Y Chen, R Liao, S Cai
Smart Computing and Communication: Second International Conference, SmartCom …, 2018
42018
HexGen: Generative Inference of Foundation Model over Heterogeneous Decentralized Environment.
Y Jiang, R Yan, X Yao, B Chen, B Yuan
CoRR, 2023
3*2023
MLPM: Machine Learning Package Manager
X Yao
ACM ISBN, 978-1, 2020
32020
Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs
Y Jiang, F Fu, X Yao, G He, X Miao, A Klimovic, B Cui, B Yuan, E Yoneki
arXiv preprint arXiv:2502.00722, 2025
22025
Open Compute Framework: Peer-to-Peer Task Queue for Foundation Model Inference Serving, September 2023
X Yao
URL https://github. com/autoai-org/OpenComputeFramework, 0
2
Data-centric Machine Learning Research (DMLR): Harnessing Momentum for Science
M Maskey, L Bat-Leah, D Brajovic, P Climaco, A Parrish, C Park, X Yao, ...
ICLR 2024 Workshops, 2024
12024
ThunderServe: High-performance and Cost-efficient LLM Serving in Cloud Environments
Y Jiang, F Fu, X Yao, T Wang, B Cui, A Klimovic, E Yoneki
arXiv preprint arXiv:2502.09334, 2025
2025
Decluttering the data mess in LLM training
M Böther, D Graur, X Yao, A Klimovic
2nd Workshop on Hot Topics in System Infrastructure (HotInfra 2024), 2024
2024
ETH Library Lab: Lernen Sie das Innovationslabor der ETH-Bibliothek kennen
M Okonnek
17: 15 Kolloquium der ETH-Bibliothek, 2021
2021
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–16