Large language models for software engineering: A systematic literature review

X Hou, Y Zhao, Y Liu, Z Yang, K Wang, L Li… - ACM Transactions on …, 2024 - dl.acm.org
Large Language Models (LLMs) have significantly impacted numerous domains, including
Software Engineering (SE). Many recent publications have explored LLMs applied to …

A survey on large language models for software engineering

Q Zhang, C Fang, Y **e, Y Zhang, Y Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Software Engineering (SE) is the systematic design, development, maintenance, and
management of software applications underpinning the digital infrastructure of our modern …

Self-taught optimizer (stop): Recursively self-improving code generation

E Zelikman, E Lorch, L Mackey… - First Conference on …, 2024 - openreview.net
Several recent advances in AI systems solve problems by providing a" scaffolding" program
that structures multiple calls to language models to generate better outputs. A scaffolding …

A Comprehensive Survey of Benchmarks for Improvement of Software's Non-Functional Properties

A Blot, J Petke - ACM Computing Surveys, 2025 - dl.acm.org
Despite recent increase in research on improvement of non-functional properties of
software, such as energy usage or program size, there is a lack of standard benchmarks for …

Evaluating language models for efficient code generation

J Liu, S **e, J Wang, Y Wei, Y Ding, L Zhang - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce Differential Performance Evaluation (DPE), a framework designed to reliably
evaluate Large Language Models (LLMs) for efficient code generation. Traditional coding …

Large language model-based agents for software engineering: A survey

J Liu, K Wang, Y Chen, X Peng, Z Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
The recent advance in Large Language Models (LLMs) has shaped a new paradigm of AI
agents, ie, LLM-based agents. Compared to standalone LLMs, LLM-based agents …

Can Language Models Solve Olympiad Programming?

Q Shi, M Tang, K Narasimhan, S Yao - arxiv preprint arxiv:2404.10952, 2024 - arxiv.org
Computing olympiads contain some of the most challenging problems for humans, requiring
complex algorithmic reasoning, puzzle solving, in addition to generating efficient code …

A systematic assessment of openai o1-preview for higher order thinking in education

E Latif, Y Zhou, S Guo, Y Gao, L Shi… - arxiv preprint arxiv …, 2024 - arxiv.org
As artificial intelligence (AI) continues to advance, it demonstrates capabilities comparable
to human intelligence, with significant potential to transform education and workforce …

Learning to refine with fine-grained natural language feedback

M Wadhwa, X Zhao, JJ Li, G Durrett - arxiv preprint arxiv:2407.02397, 2024 - arxiv.org
Recent work has explored the capability of large language models (LLMs) to identify and
correct errors in LLM-generated responses. These refinement approaches frequently …

Codev: Empowering llms for verilog generation through multi-level summarization

Y Zhao, D Huang, C Li, P **, Z Nan, T Ma, L Qi… - arxiv preprint arxiv …, 2024 - arxiv.org
The increasing complexity and high costs associated with modern processor design have
led to a surge in demand for processor design automation. Instruction-tuned large language …