On the value of oversampling for deep learning in software defect prediction
One truism of deep learning is that the automatic feature engineering (seen in the first layers
of those networks) excuses data scientists from performing tedious manual feature …
of those networks) excuses data scientists from performing tedious manual feature …
Evaluating classifiers in SE research: the ECSER pipeline and two replication studies
Context Automated classifiers, often based on machine learning (ML), are increasingly used
in software engineering (SE) for labelling previously unseen SE data. Researchers have …
in software engineering (SE) for labelling previously unseen SE data. Researchers have …
Comparative study of random search hyper-parameter tuning for software effort estimation
Empirical studies on software effort estimation have employed hyper-parameter tuning
algorithms to improve model accuracy and stability. While these tuners can improve model …
algorithms to improve model accuracy and stability. While these tuners can improve model …
Parameter tuning for software fault prediction with different variants of differential evolution
The cost of software testing could be reduced if faulty entities were identified prior to the
testing phase, which is possible with software fault prediction (SFP). In most SFP models …
testing phase, which is possible with software fault prediction (SFP). In most SFP models …
How to find actionable static analysis warnings: A case study with FindBugs
Automatically generated static code warnings suffer from a large number of false alarms.
Hence, developers only take action on a small percent of those warnings. To better predict …
Hence, developers only take action on a small percent of those warnings. To better predict …
How to improve deep learning for software analytics: (a case study with code smell detection)
To reduce technical debt and make code more maintainable, it is important to be able to
warn programmers about code smells. State-of-the-art code small detectors use deep …
warn programmers about code smells. State-of-the-art code small detectors use deep …
Enhanced Machine Learning-Based Code Smell Detection Through Hyper-Parameter Optimization
To preserve software quality and maintainability, machine learning-based code smell
detection has been proposed, and the results are promising. This research proposes an …
detection has been proposed, and the results are promising. This research proposes an …
Data quality antipatterns for software analytics
Background: Data quality is vital in software analytics, particularly for machine learning (ML)
applications like software defect prediction (SDP). Despite the widespread use of ML in …
applications like software defect prediction (SDP). Despite the widespread use of ML in …
Demystifying the Impact of Open-Source Machine Learning Libraries on Software Analytics
Machine learning (ML) classification techniques from various libraries have been widely
introduced into software engineering (SE) to mine instructive insights, which help …
introduced into software engineering (SE) to mine instructive insights, which help …
Learning from Very Little Data: On the Value of Landscape Analysis for Predicting Software Project Health
When data is scarce, software analytics can make many mistakes. For example, consider
learning predictors for open source project health (eg, the number of closed pull requests in …
learning predictors for open source project health (eg, the number of closed pull requests in …