Dealing with noise problem in machine learning data-sets: A systematic review

S Gupta, A Gupta - Procedia Computer Science, 2019 - Elsevier
The occurrences of noisy data in data set can significantly impact prediction of any
meaningful information. Many empirical studies have shown that noise in data set …

Application of deep learning in software defect prediction: systematic literature review and meta-analysis

ZM Zain, S Sakri, NHA Ismail - Information and Software Technology, 2023 - Elsevier
Context Despite recent attention given to Software Defect Prediction (SDP), the lack of any
systematic effort to assess existing empirical evidence on the application of Deep Learning …

Classification in the presence of label noise: a survey

B Frénay, M Verleysen - IEEE transactions on neural networks …, 2013 - ieeexplore.ieee.org
Label noise is an important issue in classification, with many potential negative
consequences. For example, the accuracy of predictions may decrease, whereas the …

Predictive maintenance on the machining process and machine tool

A Jimenez-Cortadi, I Irigoien, F Boto, B Sierra… - Applied Sciences, 2019 - mdpi.com
This paper presents the process required to implement a data driven Predictive
Maintenance (PdM) not only in the machine decision making, but also in data acquisition …

Benchmark study of feature selection strategies for multi-omics data

Y Li, U Mansmann, S Du, R Hornung - BMC bioinformatics, 2022 - Springer
Background In the last few years, multi-omics data, that is, datasets containing different types
of high-dimensional molecular variables for the same samples, have become increasingly …

Defending support vector machines against data poisoning attacks

S Weerasinghe, T Alpcan, SM Erfani… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Support Vector Machines (SVMs) are vulnerable to targeted training data manipulations
such as poisoning attacks and label flips. By carefully manipulating a subset of training …

[PDF][PDF] Data cleaning for classification using misclassification analysis.

P Jeatrakul, KW Wong… - J. Adv. Comput …, 2010 - researchportal.murdoch.edu.au
In most classification problems, sometimes in order to achieve better results, data cleaning is
used as a preprocessing technique. The purpose of data cleaning is to remove noise …

Improving the performance of machine learning models for biotechnology: The quest for deus ex machina

F Mey, J Clauwaert, K Van Huffel, W Waegeman… - Biotechnology …, 2021 - Elsevier
Abstract Machine learning is becoming an integral part of the Design-Build-Test-Learn cycle
in biotechnology. Machine learning models learn from collected datasets such as omics data …

UTTAMA: an intrusion detection system based on feature clustering and feature transformation

A Nagaraja, B Uma, R Gunupudi - Foundations of Science, 2020 - Springer
Detecting Intrusions and anomalies is becoming much more challenging with new attacks
pop** out over a period of time. Achieving better accuracies by applying benchmark …

Class noise detection based on software metrics and ROC curves

C Catal, O Alan, K Balkan - Information Sciences, 2011 - Elsevier
Noise detection for software measurement datasets is a topic of growing interest. The
presence of class and attribute noise in software measurement datasets degrades the …