Urmăriți
Yang Zhou
Yang Zhou
Adresă de e-mail confirmată pe andrew.cmu.edu
Titlu
Citat de
Citat de
Anul
Llm inference unveiled: Survey and roofline model insights
Z Yuan, Y Shang, Y Zhou, Z Dong, Z Zhou, C Xue, B Wu, Z Li, Q Gu, ...
arXiv preprint arXiv:2402.16363, 2024
672024
Magicpig: Lsh sampling for efficient llm generation
Z Chen, R Sadhukhan, Z Ye, Y Zhou, J Zhang, N Nolte, Y Tian, M Douze, ...
arXiv preprint arXiv:2410.16179, 2024
132024
Ant: Adapt network across time for efficient video processing
F Liang, TW Chin, Y Zhou, D Marculescu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
42022
Hexgen: Generative inference of large language model over heterogeneous environment
Y Jiang, R Yan, X Yao, Y Zhou, B Chen, B Yuan
arXiv preprint arXiv:2311.11514, 2023
32023
GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity?
Y Zhou, H Liu, Z Chen, Y Tian, B Chen
arXiv preprint arXiv:2502.05252, 2025
22025
SIRIUS: Contexual Sparisty with Correction for Efficient LLMs
Y Zhou, Z Chen, Z Xu, V Lin, B Chen
Advances in Neural Information Processing Systems 37, 24046-24080, 2025
22025
Play it cool: Dynamic shifting prevents thermal throttling
Y Zhou, F Liang, T Chin, D Marculescu
arXiv preprint arXiv:2206.10849, 2022
22022
DQRM: Deep Quantized Recommendation Models
Y Zhou, Z Dong, E Chan, D Kalamkar, D Marculescu, K Keutzer
arXiv preprint arXiv:2410.20046, 2024
12024
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–8