A survey of machine learning for big code and naturalness

M Allamanis, ET Barr, P Devanbu… - ACM Computing Surveys …, 2018 - dl.acm.org
Research at the intersection of machine learning, programming languages, and software
engineering has recently taken important steps in proposing learnable probabilistic models …

Code smells and refactoring: A tertiary systematic review of challenges and observations

G Lacerda, F Petrillo, M Pimenta… - Journal of Systems and …, 2020 - Elsevier
Refactoring and smells have been well researched by the software-engineering research
community these past decades. Several secondary studies have been published on code …

Deep learning code fragments for code clone detection

M White, M Tufano, C Vendome… - Proceedings of the 31st …, 2016 - dl.acm.org
Code clone detection is an important problem for software maintenance and evolution. Many
approaches consider either structure or identifiers, but none of the existing detection …

Vuddy: A scalable approach for vulnerable code clone discovery

S Kim, S Woo, H Lee, H Oh - 2017 IEEE symposium on security …, 2017 - ieeexplore.ieee.org
The ecosystem of open source software (OSS) has been growing considerably in size. In
addition, code clones-code fragments that are copied and pasted within or between software …

Software documentation issues unveiled

E Aghajani, C Nagy, OL Vega-Márquez… - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
(Good) Software documentation provides developers and users with a description of what a
software system does, how it operates, and how it should be used. For example, technical …

Software documentation: the practitioners' perspective

E Aghajani, C Nagy, M Linares-Vásquez… - Proceedings of the acm …, 2020 - dl.acm.org
In theory,(good) documentation is an invaluable asset to any software project, as it helps
stakeholders to use, understand, maintain, and evolve a system. In practice, however …

Software clone detection: A systematic review

D Rattan, R Bhatia, M Singh - Information and Software Technology, 2013 - Elsevier
CONTEXT: Reusing software by means of copy and paste is a frequent activity in software
development. The duplicated code is known as a software clone and the activity is known as …

DéjàVu: a map of code duplicates on GitHub

CV Lopes, P Maj, P Martins, V Saini, D Yang… - Proceedings of the …, 2017 - dl.acm.org
Previous studies have shown that there is a non-trivial amount of duplication in source code.
This paper analyzes a corpus of 4.5 million non-fork projects hosted on GitHub representing …

A systematic review on code clone detection

QU Ain, WH Butt, MW Anwar, F Azam… - IEEE access, 2019 - ieeexplore.ieee.org
Code cloning refers to the duplication of source code. It is the most common way of reusing
source code in software development. If a bug is identified in one segment of code, all the …

Comparison and evaluation of code clone detection techniques and tools: A qualitative approach

CK Roy, JR Cordy, R Koschke - Science of computer programming, 2009 - Elsevier
Over the last decade many techniques and tools for software clone detection have been
proposed. In this paper, we provide a qualitative comparison and evaluation of the current …