Benchmarking foundation models with language-model-as-an-examiner Y Bai*, J Ying*, Y Cao, X Lv, Y He, X Wang, J Yu, K Zeng, Y Xiao, H Lyu, ... Advances in Neural Information Processing Systems 36, 2024 | 116 | 2024 |
LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement J Ying, M Lin, Y Cao, W Tang, B Wang, Q Sun, X Huang, S Yan EMNLP 2024 (Findings), 2024 | 8 | 2024 |
Intuitive or Dependent? Investigating LLMs' Robustness to Conflicting Prompts J Ying, Y Cao, K Xiong, Y He, L Cui, Y Liu ACL 2024, 2023 | 7 | 2023 |
Automating dataset updates towards reliable and timely evaluation of large language models J Ying, Y Cao, Y Bai, Q Sun, B Wang, W Tang, Z Ding, Y Yang, X Huang, ... The Thirty-eight Conference on Neural Information Processing Systems …, 2024 | 6* | 2024 |
A+ B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential W Tang, Y Cao, J Ying, B Wang, Y Zhao, Y Liao, P Zhou ACL 2024, 2024 | 2 | 2024 |
EvoWiki: Evaluating LLMs on Evolving Knowledge W Tang, Y Cao, Y Deng, J Ying, B Wang, Y Yang, Y Zhao, Q Zhang, ... arXiv preprint arXiv:2412.13582, 2024 | 1 | 2024 |
Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning K Xiong, X Ding, L Du, J Ying, T Liu, B Qin, Y Cao arXiv preprint arXiv:2408.11431, 2024 | 1 | 2024 |
QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism B Wang, H Huang, Y Cao, J Ying, W Tang, C Feng EMNLP 2024 (Findings), 2024 | | 2024 |