Formal mathematical reasoning: A new frontier in ai

K Yang, G Poesia, J He, W Li, K Lauter… - arxiv preprint arxiv …, 2024 - arxiv.org
AI for Mathematics (AI4Math) is not only intriguing intellectually but also crucial for AI-driven
discovery in science, engineering, and beyond. Extensive efforts on AI4Math have mirrored …

LeanAgent: Lifelong Learning for Formal Theorem Proving

A Kumarappan, M Tiwari, P Song, RJ George… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) have been successful in mathematical reasoning tasks
such as formal theorem proving when integrated with interactive proof assistants like Lean …

Formal Theorem Proving by Rewarding LLMs to Decompose Proofs Hierarchically

K Dong, A Mahankali, T Ma - arxiv preprint arxiv:2411.01829, 2024 - arxiv.org
Mathematical theorem proving is an important testbed for large language models' deep and
abstract reasoning capability. This paper focuses on improving LLMs' ability to write proofs …

LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction

S Huang, P Song, RJ George… - arxiv preprint arxiv …, 2025 - arxiv.org
Mathematical reasoning remains a significant challenge for Large Language Models (LLMs)
due to hallucinations. When combined with formal proof assistants like Lean, these …

Diverse Inference and Verification for Advanced Reasoning

I Drori, G Longhitano, M Mao, S Hyun, Y Zhang… - arxiv preprint arxiv …, 2025 - arxiv.org
Reasoning LLMs such as OpenAI o1, o3 and DeepSeek R1 have made significant progress
in mathematics and coding, yet find challenging advanced tasks such as International …

Activation Steering in Neural Theorem Provers

S Kirtania - arxiv preprint arxiv:2502.15507, 2025 - arxiv.org
Large Language Models (LLMs) have shown promise in proving formal theorems using
proof assistants like Lean. However, current state of the art language models struggles to …

CARTS: Advancing Neural Theorem Proving with Diversified Tactic Calibration and Bias-Resistant Tree Search

XW Yang, Z Zhou, H Wang, A Li, WD Wei, H **… - … Conference on Learning … - openreview.net
Recent advancements in neural theorem proving integrate large language models with tree
search algorithms like Monte Carlo Tree Search (MCTS), where the language model …

Language models for verifiable mathematical automation: Interaction, integration, and autoformalization

Q Jiang - 2025 - repository.cam.ac.uk
Stronger automation in formal mathematical reasoning provides scalability, trust-worthiness,
and accessibility: it enables efficient verification of complex proofs, reduces the likelihood of …

NLIR: Natural Language Intermediate Representation for Mechanized Theorem Proving

L Teodorescu, G Baudart, EJG Arias… - MathAI@ NeuRIPS 2024 …, 2024 - hal.science
Formal theorem proving is challenging for humans as well as for machines. Thanks to recent
advances in LLM capabilities, we believe natural language can serve as a universal …