Прати
Yuhan Liu
Yuhan Liu
Верификована је имејл адреса на uchicago.edu - Почетна страница
Наслов
Навело
Навело
Година
Autofreeze: Automatically freezing model blocks to accelerate fine-tuning
Y Liu, S Agarwal, S Venkataraman
arXiv preprint arXiv:2102.01386, 2021
642021
CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving
Y Liu, H Li, Y Cheng, S Ray, Y Huang, Q Zhang, K Du, J Yao, S Lu, ...
Proceedings of the ACM SIGCOMM 2024 Conference, 38-56, 2024
56*2024
Multi defect detection and analysis of electron microscopy images with deep learning
M Shen, G Li, D Wu, Y Liu, JRC Greaves, W Hao, NJ Krakauer, L Krudy, ...
Computational Materials Science 199, 110576, 2021
512021
Performance and limitations of deep learning semantic segmentation of multiple defects in transmission electron micrographs
R Jacobs, M Shen, Y Liu, W Hao, X Li, R He, JRC Greaves, D Wang, Z Xie, ...
Cell Reports Physical Science 3 (5), 2022
292022
Accelerating deep learning inference via learned caches
A Balasubramanian, A Kumar, Y Liu, H Cao, S Venkataraman, A Akella
arXiv preprint arXiv:2101.07344, 2021
182021
CacheBlend: Fast Large Language Model Serving with Cached Knowledge Fusion
J Yao, H Li, Y Liu, S Ray, Y Cheng, Q Zhang, K Du, S Lu, J Jiang
arXiv preprint arXiv:2405.16444, 2024
16*2024
{GRACE}:{Loss-Resilient}{Real-Time} Video through Neural Codecs
Y Cheng, Z Zhang, H Li, A Arapin, Y Zhang, Q Zhang, Y Liu, K Du, ...
21st USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2024
142024
Run-Time Prevention of Software Integration Failures of Machine Learning APIs
C Wan, Y Liu, K Du, H Hoffmann, J Jiang, M Maire, S Lu
Proceedings of the ACM on Programming Languages 7 (OOPSLA2), 264-291, 2023
22023
DroidSpeak: Enhancing Cross-LLM Communication
Y Liu, E Choukse, S Lu, J Jiang, M Musuvathi
arXiv preprint arXiv:2411.02820, 2024
12024
Keeper: Automated Testing and Fixing of Machine Learning Software
C Wan, S Liu, S Xie, Y Liu, H Hoffmann, M Maire, S Lu
ACM Transactions on Software Engineering and Methodology 33 (7), 1-33, 2024
12024
ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications
Y Liu, C Wan, K Du, H Hoffmann, J Jiang, S Lu, M Maire
Proceedings of the 18th USENIX Symposium on Operating Systems Design and …, 2024
2024
Chatterbox: Robust Transport for LLM Token Streaming under Unstable Network
H Li, Y Liu, Y Cheng, S Ray, K Du, J Jiang
arXiv preprint arXiv:2401.12961, 2024
2024
OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation
K Du, Y Liu, Y Hao, Q Zhang, H Wang, Y Huang, G Ananthanarayanan, ...
Proceedings of the 2023 ACM Symposium on Cloud Computing, 158-176, 2023
2023
Систем тренутно не може да изврши ову радњу. Пробајте поново касније.
Чланци 1–13