MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs Z Zeng, Y Liu, Y Wan, J Li, P Chen, J Dai, Y Yao, R Xu, Z Qi, W Zhao, ... NeurIPS 2024, 2024 | 12* | 2024 |
Process-driven autoformalization in lean 4 J Lu*, Y Wan*, Z Liu, Y Huang, J Xiong, C Liu, J Shen, H Jin, J Zhang, ... arXiv preprint arXiv:2406.01940, 2024 | 6 | 2024 |