Large language models for software engineering: A systematic literature review

X Hou, Y Zhao, Y Liu, Z Yang, K Wang, L Li… - ACM Transactions on …, 2024 - dl.acm.org
Large Language Models (LLMs) have significantly impacted numerous domains, including
Software Engineering (SE). Many recent publications have explored LLMs applied to …

A survey on large language models for software engineering

Q Zhang, C Fang, Y **e, Y Zhang, Y Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Software Engineering (SE) is the systematic design, development, maintenance, and
management of software applications underpinning the digital infrastructure of our modern …

Guiding enumerative program synthesis with large language models

Y Li, J Parsert, E Polgreen - International Conference on Computer Aided …, 2024 - Springer
Abstract Pre-trained Large Language Models (LLMs) are beginning to dominate the
discourse around automatic code generation with natural language specifications. In …

Laurel: generating Dafny assertions using large language models

E Mugnier, EA Gonzalez, R Jhala… - arxiv preprint arxiv …, 2024 - arxiv.org
Dafny is a popular verification language, which automates proofs by outsourcing them to an
SMT solver. This automation is not perfect, however, and the solver often requires guidance …

Towards neural synthesis for SMT-assisted proof-oriented programming

S Chakraborty, G Ebner, S Bhat, S Fakhoury… - arxiv preprint arxiv …, 2024 - arxiv.org
Proof-oriented programs mix computational content with proofs of program correctness.
However, the human effort involved in programming and proving is still substantial, despite …

DafnyBench: A benchmark for formal software verification

C Loughridge, Q Sun, S Ahrenbach, F Cassano… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce DafnyBench, the largest benchmark of its kind for training and evaluating
machine learning systems for formal software verification. We test the ability of LLMs such as …

Agents in software engineering: Survey, landscape, and vision

Y Wang, W Zhong, Y Huang, E Shi, M Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, Large Language Models (LLMs) have achieved remarkable success and
have been widely used in various downstream tasks, especially in the tasks of the software …

Formal mathematical reasoning: A new frontier in ai

K Yang, G Poesia, J He, W Li, K Lauter… - arxiv preprint arxiv …, 2024 - arxiv.org
AI for Mathematics (AI4Math) is not only intriguing intellectually but also crucial for AI-driven
discovery in science, engineering, and beyond. Extensive efforts on AI4Math have mirrored …

AutoVerus: Automated proof generation for Rust code

C Yang, X Li, MRH Misu, J Yao, W Cui, Y Gong… - arxiv preprint arxiv …, 2024 - arxiv.org
Generative AI has shown its values for many software engineering tasks. Still in its infancy,
large language model (LLM)-based proof generation lags behind LLM-based code …

Leveraging large language models to boost Dafny's developers productivity

ÁF Silva, A Mendes, JF Ferreira - Proceedings of the 2024 IEEE/ACM …, 2024 - dl.acm.org
This research idea paper proposes leveraging Large Language Models (LLMs) to enhance
the productivity of Dafny developers. Although the use of verification-aware languages, such …