A survey of deep learning for mathematical reasoning
Mathematical reasoning is a fundamental aspect of human intelligence and is applicable in
various fields, including science, engineering, finance, and everyday life. The development …
various fields, including science, engineering, finance, and everyday life. The development …
HOL Light: A tutorial introduction
J Harrison - International Conference on Formal Methods in …, 1996 - Springer
HOL Light is a new version of the HOL theorem prover. While retaining the reliability and
programmability of earlier versions, it is more elegant, lightweight, powerful and automatic; it …
programmability of earlier versions, it is more elegant, lightweight, powerful and automatic; it …
Solving olympiad geometry without human demonstrations
Proving mathematical theorems at the olympiad level represents a notable milestone in
human-level automated reasoning,,–, owing to their reputed difficulty among the world's best …
human-level automated reasoning,,–, owing to their reputed difficulty among the world's best …
Logic-lm: Empowering large language models with symbolic solvers for faithful logical reasoning
Large Language Models (LLMs) have shown human-like reasoning abilities but still struggle
with complex logical problems. This paper introduces a novel framework, Logic-LM, which …
with complex logical problems. This paper introduces a novel framework, Logic-LM, which …
Draft, sketch, and prove: Guiding formal theorem provers with informal proofs
The formalization of existing mathematical proofs is a notoriously difficult process. Despite
decades of research on automation and proof assistants, writing formal proofs remains …
decades of research on automation and proof assistants, writing formal proofs remains …
A survey of reasoning with foundation models
Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-
world settings such as negotiation, medical diagnosis, and criminal investigation. It serves …
world settings such as negotiation, medical diagnosis, and criminal investigation. It serves …
Putnambench: Evaluating neural theorem-provers on the putnam mathematical competition
We present PutnamBench, a new multi-language benchmark for evaluating the ability of
neural theorem-provers to solve competition mathematics problems. PutnamBench consists …
neural theorem-provers to solve competition mathematics problems. PutnamBench consists …
Lego-prover: Neural theorem proving with growing libraries
Despite the success of large language models (LLMs), the task of theorem proving still
remains one of the hardest reasoning tasks that is far from being fully solved. Prior methods …
remains one of the hardest reasoning tasks that is far from being fully solved. Prior methods …
An orchestrated survey of methodologies for automated software test case generation
Test case generation is among the most labour-intensive tasks in software testing. It also has
a strong impact on the effectiveness and efficiency of software testing. For these reasons, it …
a strong impact on the effectiveness and efficiency of software testing. For these reasons, it …
A survey of neural code intelligence: Paradigms, advances and beyond
Neural Code Intelligence--leveraging deep learning to understand, generate, and optimize
code--holds immense potential for transformative impacts on the whole society. Bridging the …
code--holds immense potential for transformative impacts on the whole society. Bridging the …