Deep learning for code intelligence: Survey, benchmark and toolkit

Y Wan, Z Bi, Y He, J Zhang, H Zhang, Y Sui… - ACM Computing …, 2024 - dl.acm.org
Code intelligence leverages machine learning techniques to extract knowledge from
extensive code corpora, with the aim of develo** intelligent tools to improve the quality …

Program synthesis with large language models

J Austin, A Odena, M Nye, M Bosma… - arxiv preprint arxiv …, 2021 - arxiv.org
This paper explores the limits of the current generation of large language models for
program synthesis in general purpose programming languages. We evaluate a collection of …

Learning deep semantics for test completion

P Nie, R Banerjee, JJ Li, RJ Mooney… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
Writing tests is a time-consuming yet essential task during software development. We
propose to leverage recent advances in deep learning for text and code generation to assist …

On the evaluation of neural code summarization

E Shi, Y Wang, L Du, J Chen, S Han, H Zhang… - Proceedings of the 44th …, 2022 - dl.acm.org
Source code summaries are important for program comprehension and maintenance.
However, there are plenty of programs with missing, outdated, or mismatched summaries …

Exploring the capabilities of llms for code change related tasks

L Fan, J Liu, Z Liu, D Lo, X **a, S Li - ACM Transactions on Software …, 2024 - dl.acm.org
Developers deal with code-change-related tasks daily, eg, reviewing code. Pre-trained code
and code-change-oriented models have been adapted to help developers with such tasks …

An exploratory literature study on sharing and energy use of language models for source code

M Hort, A Grishina, L Moonen - 2023 ACM/IEEE International …, 2023 - ieeexplore.ieee.org
Context: Large language models trained on source code can support a variety of software
development tasks, such as code recommendation and program repair. Large amounts of …

Coditt5: Pretraining for source code and natural language editing

J Zhang, S Panthaplackel, P Nie, JJ Li… - Proceedings of the 37th …, 2022 - dl.acm.org
Pretrained language models have been shown to be effective in many software-related
generation tasks; however, they are not well-suited for editing tasks as they are not designed …

CCRep: Learning code change representations via pre-trained code model and query back

Z Liu, Z Tang, X **a, X Yang - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
Representing code changes as numeric feature vectors, ie, code change representations, is
usually an essential step to automate many software engineering tasks related to code …

On the significance of category prediction for code-comment synchronization

Z Yang, JW Keung, X Yu, Y **ao, Z **… - ACM Transactions on …, 2023 - dl.acm.org
Software comments sometimes are not promptly updated in sync when the associated code
is changed. The inconsistency between code and comments may mislead the developers …

Where Are Large Language Models for Code Generation on GitHub?

X Yu, L Liu, X Hu, JW Keung, J Liu, X **a - arxiv preprint arxiv:2406.19544, 2024 - arxiv.org
The increasing use of Large Language Models (LLMs) in software development has
garnered significant attention from researchers assessing the quality of the code they …