RunBugRun--An Executable Dataset for Automated Program Repair

JA Prenner, R Robbes - arxiv preprint arxiv:2304.01102, 2023 - arxiv.org
Recently, we can notice a transition to data-driven techniques in Automated Program Repair
(APR), in particular towards deep neural networks. This entails training on hundreds of …

Research on mining software repositories to facilitate refactoring

AS Nyamawe - Wiley Interdisciplinary Reviews: Data Mining …, 2023 - Wiley Online Library
Software refactoring focuses on improving software quality by applying changes to the
internal structure that do not alter the observable behavior. Determining which refactorings …

Challenges in migrating imperative deep learning programs to graph execution: an empirical study

TC Vélez, R Khatchadourian, M Bagherzadeh… - Proceedings of the 19th …, 2022 - dl.acm.org
Efficiency is essential to support responsiveness wrt ever-growing datasets, especially for
Deep Learning (DL) systems. DL frameworks have traditionally embraced deferred …

Demystifying the Impact of Open-Source Machine Learning Libraries on Software Analytics

Y Zhao, Y Gong, L Gong, S Jiang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Machine learning (ML) classification techniques from various libraries have been widely
introduced into software engineering (SE) to mine instructive insights, which help …

RepoMiner: a language-agnostic python framework to mine software repositories for defect prediction

SD Palma, D Di Nucci, D Tamburri - arxiv preprint arxiv:2111.11807, 2021 - arxiv.org
Data originating from open-source software projects provide valuable information to
enhance software quality. In the scope of Software Defect Prediction, one of the most …

[PDF][PDF] 4.2 Paper 2-On the Impact of Programming Languages on Code Quality: A Reproduction Study

ED Berger, C Hollenbeck, P Maj, O Vitek… - Analyzing Large Code …, 2023 - dspace.cvut.cz
However, large-scale hosting services for code, such as GitHub or SourceForge, offer a
glimpse into the lifecycles of software. Not only do they host the sources for millions of …