Large language models for software engineering: A systematic literature review

X Hou, Y Zhao, Y Liu, Z Yang, K Wang, L Li… - ACM Transactions on …, 2024 - dl.acm.org
Large Language Models (LLMs) have significantly impacted numerous domains, including
Software Engineering (SE). Many recent publications have explored LLMs applied to …

Towards an understanding of large language models in software engineering tasks

Z Zheng, K Ning, Q Zhong, J Chen, W Chen… - Empirical Software …, 2025 - Springer
Abstract Large Language Models (LLMs) have drawn widespread attention and research
due to their astounding performance in text generation and reasoning tasks. Derivative …

Self-refine: Iterative refinement with self-feedback

A Madaan, N Tandon, P Gupta… - Advances in …, 2024 - proceedings.neurips.cc
Like humans, large language models (LLMs) do not always generate the best output on their
first try. Motivated by how humans refine their written text, we introduce Self-Refine, an …

Wizardcoder: Empowering code large language models with evol-instruct

Z Luo, C Xu, P Zhao, Q Sun, X Geng, W Hu… - arxiv preprint arxiv …, 2023 - arxiv.org
Code Large Language Models (Code LLMs), such as StarCoder, have demonstrated
exceptional performance in code-related tasks. However, most existing models are solely …

Mathematical discoveries from program search with large language models

B Romera-Paredes, M Barekatain, A Novikov, M Balog… - Nature, 2024 - nature.com
Large language models (LLMs) have demonstrated tremendous capabilities in solving
complex tasks, from quantitative reasoning to understanding natural language. However …

Large language models for software engineering: Survey and open problems

A Fan, B Gokkaya, M Harman… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
This paper provides a survey of the emerging area of Large Language Models (LLMs) for
Software Engineering (SE). It also sets out open research challenges for the application of …

Octopack: Instruction tuning code large language models

N Muennighoff, Q Liu, A Zebaze, Q Zheng… - arxiv preprint arxiv …, 2023 - arxiv.org
Finetuning large language models (LLMs) on instructions leads to vast performance
improvements on natural language tasks. We apply instruction tuning using code …

Large language models are edge-case fuzzers: Testing deep learning libraries via fuzzgpt

Y Deng, CS **a, C Yang, SD Zhang, S Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Deep Learning (DL) library bugs affect downstream DL applications, emphasizing the need
for reliable systems. Generating valid input programs for fuzzing DL libraries is challenging …

Retroformer: Retrospective large language agents with policy gradient optimization

W Yao, S Heinecke, JC Niebles, Z Liu, Y Feng… - arxiv preprint arxiv …, 2023 - arxiv.org
Recent months have seen the emergence of a powerful new trend in which large language
models (LLMs) are augmented to become autonomous language agents capable of …

Bolaa: Benchmarking and orchestrating llm-augmented autonomous agents

Z Liu, W Yao, J Zhang, L Xue, S Heinecke… - arxiv preprint arxiv …, 2023 - arxiv.org
The massive successes of large language models (LLMs) encourage the emerging
exploration of LLM-augmented Autonomous Agents (LAAs). An LAA is able to generate …