Testing machine learning based systems: a systematic map**
Abstract Context: A Machine Learning based System (MLS) is a software system including
one or more components that learn how to perform a task from a given data set. The …
one or more components that learn how to perform a task from a given data set. The …
Empirical review of java program repair tools: A large-scale experiment on 2,141 bugs and 23,551 repair attempts
In the past decade, research on test-suite-based automatic program repair has grown
significantly. Each year, new approaches and implementations are featured in major …
significantly. Each year, new approaches and implementations are featured in major …
Coconut: combining context-aware neural translation models using ensemble for program repair
Automated generate-and-validate (GV) program repair techniques (APR) typically rely on
hard-coded rules, thus only fixing bugs following specific fix patterns. These rules require a …
hard-coded rules, thus only fixing bugs following specific fix patterns. These rules require a …
Can OpenAI's codex fix bugs? an evaluation on QuixBugs
OpenAI's Codex, a GPT-3 like model trained on a large code corpus, has made headlines in
and outside of academia. Given a short user-provided description, it is capable of …
and outside of academia. Given a short user-provided description, it is capable of …
A survey of learning-based automated program repair
Automated program repair (APR) aims to fix software bugs automatically and plays a crucial
role in software development and maintenance. With the recent advances in deep learning …
role in software development and maintenance. With the recent advances in deep learning …
On the reliability of coverage-based fuzzer benchmarking
Given a program where none of our fuzzers finds any bugs, how do we know which fuzzer is
better? In practice, we often look to code coverage as a proxy measure of fuzzer …
better? In practice, we often look to code coverage as a proxy measure of fuzzer …
Bugsinpy: a database of existing bugs in python programs to enable controlled testing and debugging studies
R Widyasari, SQ Sim, C Lok, H Qi, J Phan… - Proceedings of the 28th …, 2020 - dl.acm.org
The 2019 edition of Stack Overflow developer survey highlights that, for the first time, Python
outperformed Java in terms of popularity. The gap between Python and Java further …
outperformed Java in terms of popularity. The gap between Python and Java further …
Automatic program repair with openai's codex: Evaluating quixbugs
OpenAI's Codex, a GPT-3 like model trained on a large code corpus, has made headlines in
and outside of academia. Given a short user-provided description, it is capable of …
and outside of academia. Given a short user-provided description, it is capable of …
[PDF][PDF] Unifying the perspectives of nlp and software engineering: A survey on language models for code
Z Zhang, C Chen, B Liu, C Liao, Z Gong… - arxiv preprint arxiv …, 2023 - simg.baai.ac.cn
In this work we systematically review the recent advancements in code processing with
language models, covering 50+ models, 30+ evaluation tasks, 170+ datasets, and 700 …
language models, covering 50+ models, 30+ evaluation tasks, 170+ datasets, and 700 …
Towards javascript program repair with generative pre-trained transformer (gpt-2)
The goal of Automated Program Repair (APR) is to find a fix to software bugs, without human
intervention. The so-called Generate and Validate (G&V) approach deemed to be the most …
intervention. The so-called Generate and Validate (G&V) approach deemed to be the most …