LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models H Jiang, Q Wu, CY Lin, Y Yang, L Qiu Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 | 206 | 2023 |
LongLLMLingua: Accelerating and enhancing llms in long context scenarios via prompt compression H Jiang, Q Wu, X Luo, D Li, CY Lin, Y Yang, L Qiu ACL 2024, 2023 | 156 | 2023 |
Decomposed Meta-Learning for Few-Shot Named Entity Recognition T Ma, H Jiang, Q Wu, T Zhao, CY Lin 60th Annual Meeting of the Association for Computational Linguistics (ACL …, 2022 | 95 | 2022 |
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression Z Pan, Q Wu, H Jiang, M Xia, X Luo, J Zhang, Q Lin, V Rühle, Y Yang, ... ACL 2024 Findings, 2024 | 68 | 2024 |
Minference 1.0: Accelerating pre-filling for long-context llms via dynamic sparse attention H Jiang, Y Li, C Zhang, Q Wu, X Luo, S Ahn, Z Han, AH Abdi, D Li, CY Lin, ... NeurIPS 2024 (Spotlight), 2024 | 49* | 2024 |
AdvPicker: Effectively Leveraging Unlabeled Data via Adversarial Discriminator for Cross-Lingual NER W Chen, H Jiang, Q Wu, BF Karlsson, Y Guan Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 40 | 2021 |
PIT: Optimization of dynamic sparse deep learning models via permutation invariant transformation N Zheng, H Jiang, Q Zhang, Z Han, L Ma, Y Yang, F Yang, C Zhang, L Qiu, ... Proceedings of the 29th Symposium on Operating Systems Principles, 331-347, 2023 | 25 | 2023 |
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices C Tang, LL Zhang, H Jiang, J Xu, T Cao, Q Zhang, Y Yang, Z Wang, ... The IEEE International Conference on Computer Vision (ICCV) 2023, 2023 | 23 | 2023 |
Attentive Mask CLIP Y Yang, W Huang, Y Wei, H Peng, X Jiang, H Jiang, F Wei, Y Wang, H Hu, ... The IEEE International Conference on Computer Vision (ICCV) 2023, 2023 | 22 | 2023 |
Retrievalattention: Accelerating long-context llm inference via vector retrieval D Liu, M Chen, B Lu, H Jiang, Z Han, Q Zhang, Q Chen, C Zhang, B Ding, ... arXiv preprint arXiv:2409.10516, 2024 | 19* | 2024 |
Hybrid slm and llm for edge-cloud collaborative inference Z Hao, H Jiang, S Jiang, J Ren, T Cao Proceedings of the Workshop on Edge and Mobile Foundation Models, 36-41, 2024 | 10 | 2024 |
Mitigate position bias in large language models via scaling a single dimension Y Yu, H Jiang, X Luo, Q Wu, CY Lin, D Li, Y Yang, Y Huang, L Qiu arXiv preprint arXiv:2406.02536, 2024 | 9 | 2024 |
Accurate and Structured Pruning for Efficient Automatic Speech Recognition H Jiang, LL Zhang, Y Li, Y Wu, S Cao, T Cao, Y Yang, J Li, M Yang, L Qiu 24rd Interspeech 2023, 2023 | 9 | 2023 |
Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text Q Wu, H Jiang, H Yin, BF Karlsson, CY Lin 61th Annual Meeting of the Association for Computational Linguistics (ACL 2023), 2022 | 9 | 2022 |
BoningKnife: Joint entity mention detection and typing for nested NER via prior boundary knowledge H Jiang, G Wang, W Chen, C Zhang, BF Karlsson arXiv preprint arXiv:2107.09429, 2021 | 6 | 2021 |
CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition T Ma, Q Wu, H Jiang, BF Karlsson, T Zhao, CY Lin 61th Annual Meeting of the Association for Computational Linguistics (ACL 2023), 2023 | 5 | 2023 |
Decomposed Meta-Learning for Few-Shot Sequence Labeling T Ma, Q Wu, H Jiang, J Lin, BF Karlsson, T Zhao, CY Lin IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 4 | 2024 |
Scbench: A kv cache-centric analysis of long-context methods Y Li, H Jiang, Q Wu, X Luo, S Ahn, C Zhang, AH Abdi, D Li, J Gao, Y Yang, ... ICLR 2025, 2024 | 3 | 2024 |
Position Engineering: Boosting Large Language Models through Positional Information Manipulation Z He, H Jiang, Z Wang, Y Yang, L Qiu, L Qiu EMNLP 2024, 2024 | 3 | 2024 |
End-to-End Word-Level Pronunciation Assessment with MASK Pre-training Y Liang, K Song, S Mao, H Jiang, L Qiu, Y Yang, D Li, L Xu, L Qiu 24rd Interspeech 2023, 2023 | 3 | 2023 |