Retrieval-augmented generation for ai-generated content: A survey

P Zhao, H Zhang, Q Yu, Z Wang, Y Geng, F Fu… - arxiv preprint arxiv …, 2024 - arxiv.org
The development of Artificial Intelligence Generated Content (AIGC) has been facilitated by
advancements in model algorithms, scalable foundation model architectures, and the …

Generative software engineering

Y Huang, Y Chen, X Chen, J Chen, R Peng… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid development of deep learning techniques, improved computational power, and
the availability of vast training data have led to significant advancements in pre-trained …

Evaluating large language models in class-level code generation

X Du, M Liu, K Wang, H Wang, J Liu, Y Chen… - Proceedings of the …, 2024 - dl.acm.org
Recently, many large language models (LLMs) have been proposed, showing advanced
proficiency in code generation. Meanwhile, many efforts have been dedicated to evaluating …

A survey on large language models for software engineering

Q Zhang, C Fang, Y **e, Y Zhang, Y Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Software Engineering (SE) is the systematic design, development, maintenance, and
management of software applications underpinning the digital infrastructure of our modern …

On the effectiveness of large language models in domain-specific code generation

X Gu, M Chen, Y Lin, Y Hu, H Zhang, C Wan… - ACM Transactions on …, 2024 - dl.acm.org
Large language models (LLMs) such as ChatGPT have shown remarkable capabilities in
code generation. Despite significant achievements, they rely on enormous training data to …

A catalog of data smells for coding tasks

A Vitale, R Oliveto, S Scalabrino - ACM Transactions on Software …, 2024 - dl.acm.org
Large Language Models (LLMs) are increasingly becoming fundamental in supporting
software developers in coding tasks. The massive datasets used for training LLMs are often …

Compositional API recommendation for library-oriented code generation

Z Ma, S An, B **e, Z Lin - Proceedings of the 32nd IEEE/ACM …, 2024 - dl.acm.org
Large language models (LLMs) have achieved exceptional performance in code generation.
However, the performance remains unsatisfactory in generating library-oriented code …

Beyond functional correctness: Investigating coding style inconsistencies in large language models

Y Wang, T Jiang, M Liu, J Chen, Z Zheng - arxiv preprint arxiv:2407.00456, 2024 - arxiv.org
Large language models (LLMs) have brought a paradigm shift to the field of code
generation, offering the potential to enhance the software development process. However …

Contextual api completion for unseen repositories using llms

N Nashid, T Shabani, P Alian, A Mesbah - arxiv preprint arxiv:2405.04600, 2024 - arxiv.org
Large language models have made substantial progress in addressing diverse code-related
tasks. However, their adoption is hindered by inconsistencies in generating output due to the …

Llm hallucinations in practical code generation: Phenomena, mechanism, and mitigation

Z Zhang, Y Wang, C Wang, J Chen, Z Zheng - arxiv preprint arxiv …, 2024 - arxiv.org
Code generation aims to automatically generate code from input requirements, significantly
enhancing development efficiency. Recent large language models (LLMs) based …