Mwptoolkit: An open-source framework for deep learning-based math word problem solvers Y Lan, L Wang, Q Zhang, Y Lan, BT Dai, Y Wang, D Zhang, EP Lim Proceedings of the AAAI Conference on Artificial Intelligence 36 (11), 13188 …, 2022 | 44 | 2022 |
NOAHQA: Numerical reasoning with interpretable graph question answering dataset Q Zhang, L Wang, S Yu, S Wang, Y Wang, J Jiang, EP Lim arXiv preprint arXiv:2109.10604, 2021 | 20 | 2021 |
Reviseval: Improving llm-as-a-judge via response-adapted references Q Zhang, Y Wang, T Yu, Y Jiang, C Wu, L Li, Y Wang, X Jiang, L Shang, ... arXiv preprint arXiv:2410.05193, 2024 | 5 | 2024 |
Collaborative Performance Prediction for Large Language Models Q Zhang, F Lyu, X Liu, C Ma arXiv preprint arXiv:2407.01300, 2024 | 3 | 2024 |
Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge Q Zhang, Y Wang, Y Jiang, L Li, C Wu, Y Wang, X Jiang, L Shang, R Tang, ... arXiv preprint arXiv:2502.12501, 2025 | | 2025 |
NILE: Internal Consistency Alignment in Large Language Models M Hu, Q Zhang, Y Wang, B He, H Wang, J Zhou, L Li, Y Wang, C Ma, ... arXiv preprint arXiv:2412.16686, 2024 | | 2024 |