Internal consistency and self-feedback in large language models: A survey X Liang, S Song, Z Zheng, H Wang, Q Yu, X Li, RH Li, Y Wang, Z Wang, ... arXiv preprint arXiv:2407.14507, 2024 | 25 | 2024 |
xFinder: Robust and Pinpoint Answer Extraction for Large Language Models Q Yu, Z Zheng, S Song, Z Li, F Xiong, B Tang, D Chen ICLR 2025, 2024 | 9 | 2024 |
Grimoire is All You Need for Enhancing Large Language Models D Chen, S Song, Q Yu, Z Li, W Wang, F Xiong, B Tang arXiv preprint arXiv:2401.03385, 2024 | 3 | 2024 |
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Q Yu, S Song, K Fang, Y Shi, Z Zheng, H Wang, S Niu, Z Li arXiv preprint arXiv:2410.05262, 2024 | | 2024 |