Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge X Shen, P Dong, L Lu, Z Kong, Z Li, M Lin, C Wu, Y Wang Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 18944 …, 2024 | 19 | 2024 |
Packqvit: Faster sub-8-bit vision transformers via full and packed quantization on the mobile P Dong, L Lu, C Wu, C Lyu, G Yuan, H Tang, Y Wang Advances in Neural Information Processing Systems 36, 2024 | 16 | 2024 |
EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge X Shen, Z Kong, C Yang, Z Han, L Lu, P Dong, C Lyu, C Li, X Guo, Z Shu, ... arXiv preprint arXiv:2402.10787, 2024 | 12 | 2024 |
SADOE: Sequential‐based angle‐Doppler off‐grid estimation with coprime sampling structures for space‐time adaptive processing B Li, L Lu, C Zhou IET Radar, Sonar & Navigation 15 (7), 775-787, 2021 | 7 | 2021 |
Off-grid angle-Doppler estimation for space-time adaptive processing: A sequential approach L Lu, C Zhou, Z Shi, J Chen 2019 IEEE/CIC International Conference on Communications in China (ICCC …, 2019 | 6 | 2019 |
HotaQ: Hardware Oriented Token Adaptive Quantization for Large Language Models X Shen, Z Han, L Lu, Z Kong, P Dong, Z Li, Y Xie, C Wu, M Leeser, P Zhao, ... IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2024 | 2 | 2024 |
All-in-One Tuning and Structural Pruning for Domain-Specific LLMs L Lu, Z Wang, R Bao, M Wang, F Li, Y Wu, W Jiang, J Xu, Y Wang, S Gao arXiv preprint arXiv:2412.14426, 2024 | | 2024 |
Fully Open Source Moxin-7B Technical Report P Zhao, X Shen, Z Kong, Y Shen, SE Chang, T Rupprecht, L Lu, E Nan, ... arXiv preprint arXiv:2412.06845, 2024 | | 2024 |
HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression L Lu, Y Xie, W Jiang, W Wang, X Lin, Y Wang Proceedings of the 32nd ACM International Conference on Multimedia, 3010-3018, 2024 | | 2024 |
Digital avatars: framework development and their evaluation T Rupprecht, SE Chang, Y Wu, L Lu, E Nan, C Li, C Lai, Z Li, Z Hu, Y He, ... arXiv preprint arXiv:2408.04068, 2024 | | 2024 |
FasterVD: on acceleration of video diffusion models P Yu, D Luo, T Rupprecht, L Lu, Z Kong, P Zhao, Y Li, O Camps, X Lin, ... Proceedings of the Thirty-Third International Joint Conference on Artificial …, 2024 | | 2024 |