Beyond Limited Data: Self-play LLM Theorem Provers with Iterative Conjecturing and Proving

K Dong, T Ma - arxiv preprint arxiv:2502.00212, 2025 - arxiv.org
A fundamental challenge in formal theorem proving by LLMs is the lack of high-quality
training data. Although reinforcement learning or expert iteration partially mitigates this issue …