Huiqiang Jiang

Cytowane przez

	Wszystkie	Od 2020
Cytowania	768	768
h-indeks	10	10
i10-indeks	11	11

520

260

130

390

202220232024202518 93 519 134

Współautorzy

Lili QiuNAI Fellow, ACM Fellow, IEEE Fellow, Professor, Dept. of Computer Science, The University of TexasZweryfikowany adres z cs.utexas.edu
Yuqing YangMicrosoftZweryfikowany adres z microsoft.com
Qianhui Wu (武千惠)Microsoft ResearchZweryfikowany adres z tsinghua.org.cn
Chin-Yew LinPrincipal Research Manager of Knowledge Computing Group, Microsoft Research AsiaZweryfikowany adres z microsoft.com
Börje F. KarlssonBeijing Academy of Artificial Intelligence (BAAI)Zweryfikowany adres z baai.ac.cn
Yucheng LiUniversity of SurreyZweryfikowany adres z surrey.ac.uk
Chengruidong ZhangResearch SDE, MicrosoftZweryfikowany adres z microsoft.com
Baotong LuMicrosoft ResearchZweryfikowany adres z microsoft.com
Guoxin WangMicrosoftZweryfikowany adres z microsoft.com
Dongsheng LiMicrosoft Research AsiaZweryfikowany adres z microsoft.com

Obserwuj

Huiqiang Jiang

Microsoft Research Asia

Zweryfikowany adres z microsoft.com - Strona główna

Efficient AI LLMs MLSys


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models H Jiang, Q Wu, CY Lin, Y Yang, L Qiu Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023	206	2023
LongLLMLingua: Accelerating and enhancing llms in long context scenarios via prompt compression H Jiang, Q Wu, X Luo, D Li, CY Lin, Y Yang, L Qiu ACL 2024, 2023	156	2023
Decomposed Meta-Learning for Few-Shot Named Entity Recognition T Ma, H Jiang, Q Wu, T Zhao, CY Lin 60th Annual Meeting of the Association for Computational Linguistics (ACL …, 2022	95	2022
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression Z Pan, Q Wu, H Jiang, M Xia, X Luo, J Zhang, Q Lin, V Rühle, Y Yang, ... ACL 2024 Findings, 2024	68	2024
Minference 1.0: Accelerating pre-filling for long-context llms via dynamic sparse attention H Jiang, Y Li, C Zhang, Q Wu, X Luo, S Ahn, Z Han, AH Abdi, D Li, CY Lin, ... NeurIPS 2024 (Spotlight), 2024	49*	2024
AdvPicker: Effectively Leveraging Unlabeled Data via Adversarial Discriminator for Cross-Lingual NER W Chen, H Jiang, Q Wu, BF Karlsson, Y Guan Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021	40	2021
PIT: Optimization of dynamic sparse deep learning models via permutation invariant transformation N Zheng, H Jiang, Q Zhang, Z Han, L Ma, Y Yang, F Yang, C Zhang, L Qiu, ... Proceedings of the 29th Symposium on Operating Systems Principles, 331-347, 2023	25	2023
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices C Tang, LL Zhang, H Jiang, J Xu, T Cao, Q Zhang, Y Yang, Z Wang, ... The IEEE International Conference on Computer Vision (ICCV) 2023, 2023	23	2023
Attentive Mask CLIP Y Yang, W Huang, Y Wei, H Peng, X Jiang, H Jiang, F Wei, Y Wang, H Hu, ... The IEEE International Conference on Computer Vision (ICCV) 2023, 2023	22	2023
Retrievalattention: Accelerating long-context llm inference via vector retrieval D Liu, M Chen, B Lu, H Jiang, Z Han, Q Zhang, Q Chen, C Zhang, B Ding, ... arXiv preprint arXiv:2409.10516, 2024	19*	2024
Hybrid slm and llm for edge-cloud collaborative inference Z Hao, H Jiang, S Jiang, J Ren, T Cao Proceedings of the Workshop on Edge and Mobile Foundation Models, 36-41, 2024	10	2024
Mitigate position bias in large language models via scaling a single dimension Y Yu, H Jiang, X Luo, Q Wu, CY Lin, D Li, Y Yang, Y Huang, L Qiu arXiv preprint arXiv:2406.02536, 2024	9	2024
Accurate and Structured Pruning for Efficient Automatic Speech Recognition H Jiang, LL Zhang, Y Li, Y Wu, S Cao, T Cao, Y Yang, J Li, M Yang, L Qiu 24rd Interspeech 2023, 2023	9	2023
Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text Q Wu, H Jiang, H Yin, BF Karlsson, CY Lin 61th Annual Meeting of the Association for Computational Linguistics (ACL 2023), 2022	9	2022
BoningKnife: Joint entity mention detection and typing for nested NER via prior boundary knowledge H Jiang, G Wang, W Chen, C Zhang, BF Karlsson arXiv preprint arXiv:2107.09429, 2021	6	2021
CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition T Ma, Q Wu, H Jiang, BF Karlsson, T Zhao, CY Lin 61th Annual Meeting of the Association for Computational Linguistics (ACL 2023), 2023	5	2023
Decomposed Meta-Learning for Few-Shot Sequence Labeling T Ma, Q Wu, H Jiang, J Lin, BF Karlsson, T Zhao, CY Lin IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024	4	2024
Scbench: A kv cache-centric analysis of long-context methods Y Li, H Jiang, Q Wu, X Luo, S Ahn, C Zhang, AH Abdi, D Li, J Gao, Y Yang, ... ICLR 2025, 2024	3	2024
Position Engineering: Boosting Large Language Models through Positional Information Manipulation Z He, H Jiang, Z Wang, Y Yang, L Qiu, L Qiu EMNLP 2024, 2024	3	2024
End-to-End Word-Level Pronunciation Assessment with MASK Pre-training Y Liang, K Song, S Mao, H Jiang, L Qiu, Y Yang, D Li, L Xu, L Qiu 24rd Interspeech 2023, 2023	3	2023

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy