Survey on factuality in large language models: Knowledge, retrieval and domain-specificity C Wang*, X Liu*, Y Yue*, X Tang, T Zhang, C Jiayang, Y Yao, W Gao, ... ACM Computing Surveys (*equal contribution), 2023 | 194 | 2023 |
Evaluating open-qa evaluation C Wang, S Cheng, Q Guo, Y Yue, B Ding, Z Xu, Y Wang, X Hu, Z Zhang, ... NeurIPS 2023, 2024 | 58 | 2024 |
Distilling Instruction-following Abilities of Large Language Models with Task-aware Curriculum Planning Y Yue, C Wang, J Huang, P Wang EMNLP 2024 Findings, 2024 | 1 | 2024 |
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines X Du, Y Yao, K Ma, B Wang, T Zheng, K Zhu, M Liu, Y Liang, X Jin, Z Wei, ... arXiv preprint arXiv:2502.14739, 2025 | | 2025 |
Building a Family of Data Augmentation Models for Low-cost LLM Fine-tuning on the Cloud Y Yue*, C Wang*, J Huang, P Wang COLING 2025 Oral, 2024 | | 2024 |