Resource-efficient algorithms and systems of foundation models: A survey M Xu, D Cai, W Yin, S Wang, X Jin, X Liu ACM Computing Surveys 57 (5), 1-39, 2025 | 104* | 2025 |
EdgeLLM: Fast On-device LLM Inference with Speculative Decoding D Xu, W Yin, H Zhang, X Jin, Y Zhang, S Wei, M Xu, X Liu IEEE Transactions on Mobile Computing, 2024 | 52* | 2024 |
Llm as a system service on mobile devices W Yin, M Xu, Y Li, X Liu arXiv preprint arXiv:2403.11805, 2024 | 44 | 2024 |
Elms: Elasticized large language models on mobile devices W Yin, R Yi, D Xu, G Huang, M Xu, X Liu arXiv preprint arXiv:2409.09071, 2024 | 5 | 2024 |
PieBridge: Fast and Parameter-Efficient On-Device Training via Proxy Networks W Yin, D Xu, G Huang, Y Zhang, S Wei, M Xu, X Liu Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems …, 2024 | | 2024 |