Can Large Language Models Understand Real-World Complex Instructions? Q He, J Zeng, W Huang, L Chen, J Xiao, Q He, X Zhou, J Liang, Y Xiao Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 18188 …, 2024 | 49 | 2024 |
Xiezhi: An ever-updating benchmark for holistic domain knowledge evaluation Z Gu, X Zhu, H Ye, L Zhang, J Wang, Y Zhu, S Jiang, Z Xiong, Z Li, W Wu, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 18099 …, 2024 | 48 | 2024 |