Qwen2.5 Technical Report A Yang, B Yang, B Zhang, B Hui, B Zheng, B Yu, C Li, D Liu, F Huang, ... Technical Report, 2024 | 902* | 2024 |
Qwen2.5-Coder Technical Report B Hui, J Yang, Z Cui, J Yang, D Liu, L Zhang, T Liu, J Zhang, B Yu, ... Technical Report, 2024 | 101* | 2024 |
One shot learning as instruction data prospector for large language models Y Li, B Hui, X Xia, J Yang, M Yang, L Zhang, S Si, J Liu, T Liu, F Huang, ... ACL'24, 2023 | 42* | 2023 |
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA M Wang, L Chen, C Fu, S Liao, X Zhang, B Wu, H Yu, N Xu, L Zhang, ... EMNLP'24, 2024 | 26 | 2024 |
Deem: Diffusion models serve as the eyes of large language models for image perception R Luo, Y Li, L Chen, W He, TE Lin, Z Liu, L Zhang, Z Song, X Xia, T Liu, ... ICLR'25, 2024 | 12 | 2024 |
Lifelong language learning with adaptive uncertainty regularization L Zhang, S Wang, F Yuan, B Geng, M Yang Information Sciences 622, 794-807, 2023 | 9 | 2023 |
Marathon: A Race Through the Realm of Long Context with Large Language Models L Zhang, Y Li, Z Liu, J Liu, M Yang ACL'24, 2023 | 7 | 2023 |
Image-text retrieval via contrastive learning with auxiliary generative features and support-set regularization L Zhang, M Yang, C Li, R Xu SIGIR'22, 1938-1943, 2022 | 5 | 2022 |
Evaluating and aligning codellms on human preference J Yang, J Yang, K Jin, Y Miao, L Zhang, L Yang, Z Cui, Y Zhang, B Hui, ... arXiv preprint arXiv:2412.05210, 2024 | 3 | 2024 |
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey L Chen, Z Wang, S Ren, L Li, H Zhao, Y Li, Z Cai, H Guo, L Zhang, ... arXiv preprint arXiv:2412.18619, 2024 | 2 | 2024 |
Hierarchical Context Pruning: Optimizing Real-World Code Completion with Repository-Level Pretrained Code LLMs L Zhang, Y Li, J Li, X Xia, J Yang, R Luo, M Wang, L Chen, J Liu, M Yang AAAI'25, 2024 | 2 | 2024 |
ExecRepoBench: Multi-level Executable Code Completion Evaluation J Yang, J Zhang, J Yang, K Jin, L Zhang, Q Peng, K Deng, Y Miao, T Liu, ... arXiv preprint arXiv:2412.11990, 2024 | 1 | 2024 |
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models J Li, L Zhang, Y Li, Z Liu, R Luo, L Chen, M Yang EMNLP'24 Findings, 2024 | 1 | 2024 |
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis R Luo, TE Lin, H Zhang, Y Wu, X Liu, M Yang, Y Li, L Chen, J Li, L Zhang, ... arXiv preprint arXiv:2501.04561, 2025 | | 2025 |