A survey on deep learning for software engineering
In 2006, Geoffrey Hinton proposed the concept of training “Deep Neural Networks (DNNs)”
and an improved model training method to break the bottleneck of neural network …
and an improved model training method to break the bottleneck of neural network …
A systematic literature review on the use of deep learning in software engineering research
An increasingly popular set of techniques adopted by software engineering (SE)
researchers to automate development tasks are those rooted in the concept of Deep …
researchers to automate development tasks are those rooted in the concept of Deep …
Codexglue: A machine learning benchmark dataset for code understanding and generation
Benchmark datasets have a significant impact on accelerating research in programming
language tasks. In this paper, we introduce CodeXGLUE, a benchmark dataset to foster …
language tasks. In this paper, we introduce CodeXGLUE, a benchmark dataset to foster …
Codenet: A large-scale ai for code dataset for learning a diversity of coding tasks
Over the last several decades, software has been woven into the fabric of every aspect of
our society. As software development surges and code infrastructure of enterprise …
our society. As software development surges and code infrastructure of enterprise …
An empirical comparison of pre-trained models of source code
While a large number of pre-trained models of source code have been successfully
developed and applied to a variety of software engineering (SE) tasks in recent years, our …
developed and applied to a variety of software engineering (SE) tasks in recent years, our …
Palmtree: Learning an assembly language model for instruction embedding
Deep learning has demonstrated its strengths in numerous binary analysis tasks, including
function boundary detection, binary code search, function prototype inference, value set …
function boundary detection, binary code search, function prototype inference, value set …
Programl: A graph-based program representation for data flow analysis and compiler optimizations
Abstract Machine learning (ML) is increasingly seen as a viable approach for building
compiler optimization heuristics, but many ML methods cannot replicate even the simplest of …
compiler optimization heuristics, but many ML methods cannot replicate even the simplest of …
A survey on machine learning techniques for source code analysis
The advancements in machine learning techniques have encouraged researchers to apply
these techniques to a myriad of software engineering tasks that use source code analysis …
these techniques to a myriad of software engineering tasks that use source code analysis …
Flow2vec: Value-flow-based precise code embedding
Code embedding, as an emerging paradigm for source code analysis, has attracted much
attention over the past few years. It aims to represent code semantics through distributed …
attention over the past few years. It aims to represent code semantics through distributed …
Contrastive code representation learning
Recent work learns contextual representations of source code by reconstructing tokens from
their context. For downstream semantic understanding tasks like summarizing code in …
their context. For downstream semantic understanding tasks like summarizing code in …