A survey on deep learning for software engineering
In 2006, Geoffrey Hinton proposed the concept of training “Deep Neural Networks (DNNs)”
and an improved model training method to break the bottleneck of neural network …
and an improved model training method to break the bottleneck of neural network …
A survey of machine learning for big code and naturalness
Research at the intersection of machine learning, programming languages, and software
engineering has recently taken important steps in proposing learnable probabilistic models …
engineering has recently taken important steps in proposing learnable probabilistic models …
Lever: Learning to verify language-to-code generation with execution
The advent of large language models trained on code (code LLMs) has led to significant
progress in language-to-code generation. State-of-the-art approaches in this area combine …
progress in language-to-code generation. State-of-the-art approaches in this area combine …
Self-planning code generation with large language models
Although large language models (LLMs) have demonstrated impressive ability in code
generation, they are still struggling to address the complicated intent provided by humans. It …
generation, they are still struggling to address the complicated intent provided by humans. It …
Ai-generated content (aigc): A survey
J Wu, W Gan, Z Chen, S Wan, H Lin - arxiv preprint arxiv:2304.06632, 2023 - arxiv.org
To address the challenges of digital intelligence in the digital economy, artificial intelligence-
generated content (AIGC) has emerged. AIGC uses artificial intelligence to assist or replace …
generated content (AIGC) has emerged. AIGC uses artificial intelligence to assist or replace …
Graphcodebert: Pre-training code representations with data flow
Pre-trained models for programming language have achieved dramatic empirical
improvements on a variety of code-related tasks such as code search, code completion …
improvements on a variety of code-related tasks such as code search, code completion …
Codebleu: a method for automatic evaluation of code synthesis
Evaluation metrics play a vital role in the growth of an area as it defines the standard of
distinguishing between good and bad models. In the area of code synthesis, the commonly …
distinguishing between good and bad models. In the area of code synthesis, the commonly …
A syntax-guided edit decoder for neural program repair
Automated Program Repair (APR) helps improve the efficiency of software development and
maintenance. Recent APR techniques use deep learning, particularly the encoder-decoder …
maintenance. Recent APR techniques use deep learning, particularly the encoder-decoder …
Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-sql task
We present Spider, a large-scale, complex and cross-domain semantic parsing and text-to-
SQL dataset annotated by 11 college students. It consists of 10,181 questions and 5,693 …
SQL dataset annotated by 11 college students. It consists of 10,181 questions and 5,693 …
code2seq: Generating sequences from structured representations of code
The ability to generate natural language sequences from source code snippets has a variety
of applications such as code summarization, documentation, and retrieval. Sequence-to …
of applications such as code summarization, documentation, and retrieval. Sequence-to …