Efficient Large Language Models: A Survey Z Wan, X Wang, C Liu, S Alam, Y Zheng, J Liu, Z Qu, S Yan, Y Zhu, ... TMLR 2024, 2023 | 131 | 2023 |
The Internet of Things in the Era of Generative AI: Vision and Challenges X Wang, Z Wan, A Hekmati, M Zong, S Alam, M Zhang, B Krishnamachari IEEE Internet Computing 2024, 2024 | 22* | 2024 |
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression X Wang, Y Zheng, Z Wan, M Zhang ICLR 2025, 2025 | 20 | 2025 |
MEIT: Multi-modal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation Z Wan, C Liu, X Wang, C Tao, H Shen, Z Peng, J Fu, R Arcucci, H Yao, ... arXiv preprint arXiv:2403.04945, 2024 | 16 | 2024 |
Data Stream Clustering: An In-depth Empirical Study X Wang*, Z Wang*, Z Wu, S Zhang, X Shi, L Lu SIGMOD 2023, 2023 | 9 | 2023 |
D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models Z Wan, X Wu, Y Zhang, Y Xin, C Tao, Z Zhu, X Wang, S Luo, J Xiong, ... ICLR 2025, 2025 | 8 | 2025 |
Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion H Shen, Z Wan, X Wang, M Zhang ECCV 2024 Workshop on Computational Aspects of Deep Learning (🏆Best Paper Award), 2024 | 2 | 2024 |
MEDA: Dynamic Cache Allocation for Efficient Multimodal Long-Context Inference Z Wan, H Shen, X Wang, C Liu, Z Mai, M Zhang NAACL 2025, 2025 | | 2025 |
SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression X Wang, S Alam, Z Wan, H Shen, M Zhang NAACL 2025, 2025 | | 2025 |
MOStream: A Modular and Self-Optimizing Data Stream Clustering Algorithm X Wang*, Z Wang*, S Zhang ICDM 2024, 2024 | | 2024 |