Dealing with noise problem in machine learning data-sets: A systematic review
The occurrences of noisy data in data set can significantly impact prediction of any
meaningful information. Many empirical studies have shown that noise in data set …
meaningful information. Many empirical studies have shown that noise in data set …
Application of deep learning in software defect prediction: systematic literature review and meta-analysis
Context Despite recent attention given to Software Defect Prediction (SDP), the lack of any
systematic effort to assess existing empirical evidence on the application of Deep Learning …
systematic effort to assess existing empirical evidence on the application of Deep Learning …
Classification in the presence of label noise: a survey
Label noise is an important issue in classification, with many potential negative
consequences. For example, the accuracy of predictions may decrease, whereas the …
consequences. For example, the accuracy of predictions may decrease, whereas the …
Predictive maintenance on the machining process and machine tool
This paper presents the process required to implement a data driven Predictive
Maintenance (PdM) not only in the machine decision making, but also in data acquisition …
Maintenance (PdM) not only in the machine decision making, but also in data acquisition …
Benchmark study of feature selection strategies for multi-omics data
Y Li, U Mansmann, S Du, R Hornung - BMC bioinformatics, 2022 - Springer
Background In the last few years, multi-omics data, that is, datasets containing different types
of high-dimensional molecular variables for the same samples, have become increasingly …
of high-dimensional molecular variables for the same samples, have become increasingly …
Defending support vector machines against data poisoning attacks
Support Vector Machines (SVMs) are vulnerable to targeted training data manipulations
such as poisoning attacks and label flips. By carefully manipulating a subset of training …
such as poisoning attacks and label flips. By carefully manipulating a subset of training …
[PDF][PDF] Data cleaning for classification using misclassification analysis.
P Jeatrakul, KW Wong… - J. Adv. Comput …, 2010 - researchportal.murdoch.edu.au
In most classification problems, sometimes in order to achieve better results, data cleaning is
used as a preprocessing technique. The purpose of data cleaning is to remove noise …
used as a preprocessing technique. The purpose of data cleaning is to remove noise …
Improving the performance of machine learning models for biotechnology: The quest for deus ex machina
Abstract Machine learning is becoming an integral part of the Design-Build-Test-Learn cycle
in biotechnology. Machine learning models learn from collected datasets such as omics data …
in biotechnology. Machine learning models learn from collected datasets such as omics data …
UTTAMA: an intrusion detection system based on feature clustering and feature transformation
Detecting Intrusions and anomalies is becoming much more challenging with new attacks
pop** out over a period of time. Achieving better accuracies by applying benchmark …
pop** out over a period of time. Achieving better accuracies by applying benchmark …
Class noise detection based on software metrics and ROC curves
C Catal, O Alan, K Balkan - Information Sciences, 2011 - Elsevier
Noise detection for software measurement datasets is a topic of growing interest. The
presence of class and attribute noise in software measurement datasets degrades the …
presence of class and attribute noise in software measurement datasets degrades the …