On the" naturalness" of buggy code

B Ray, V Hellendoorn, S Godhane, Z Tu… - Proceedings of the 38th …, 2016 - dl.acm.org
Real software, the kind working programmers produce by the kLOC to solve real-world
problems, tends to be" natural", like speech or natural language; it tends to be highly …

Checking smart contracts with structural code embedding

Z Gao, L Jiang, X **a, D Lo… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Smart contracts have been increasingly used together with blockchains to automate
financial and business transactions. However, many bugs and vulnerabilities have been …

Codit: Code editing with tree-based neural models

S Chakraborty, Y Ding, M Allamanis… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
The way developers edit day-to-day code tends to be repetitive, often using existing code
elements. Many researchers have tried to automate repetitive code changes by learning …

Automated software transplantation

ET Barr, M Harman, Y Jia, A Marginean… - Proceedings of the 2015 …, 2015 - dl.acm.org
Automated transplantation would open many exciting avenues for software development:
suppose we could autotransplant code from one system into another, entirely unrelated …

CodeEditor: Learning to Edit Source Code with Pre-trained Models

J Li, G Li, Z Li, Z **, X Hu, K Zhang, Z Fu - ACM Transactions on …, 2023 - dl.acm.org
Developers often perform repetitive code editing activities (up to 70%) for various reasons
(eg, code refactoring) during software development. Many deep learning (DL) models have …

How has forking changed in the last 20 years? a study of hard forks on github

S Zhou, B Vasilescu, C Kästner - Proceedings of the ACM/IEEE 42nd …, 2020 - dl.acm.org
The notion of forking has changed with the rise of distributed version control systems and
social coding environments, like GitHub. Traditionally forking refers to splitting off an …

Cross-language code search using static and dynamic analyses

G Mathew, KT Stolee - Proceedings of the 29th ACM Joint Meeting on …, 2021 - dl.acm.org
As code search permeates most activities in software development, code-to-code search
has emerged to support using code as a query and retrieving similar code in the search …

Why aren't regular expressions a lingua franca? an empirical study on the re-use and portability of regular expressions

JC Davis, LG Michael IV, CA Coghlan… - Proceedings of the …, 2019 - dl.acm.org
This paper explores the extent to which regular expressions (regexes) are portable across
programming languages. Many languages offer similar regex syntaxes, and it would be …

What the fork: a study of inefficient and efficient forking practices in social coding

S Zhou, B Vasilescu, C Kästner - Proceedings of the 2019 27th ACM …, 2019 - dl.acm.org
Forking and pull requests have been widely used in open-source communities as a uniform
development and contribution mechanism, giving developers the flexibility to modify their …

A survey on renamings of software entities

G Li, H Liu, AS Nyamawe - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
More than 70% of characters in the source code are used to label identifiers. Consequently,
identifiers are one of the most important source for program comprehension. Meaningful …