SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary

A Fernández, S Garcia, F Herrera, NV Chawla - Journal of artificial …, 2018 - jair.org
The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is
considered" de facto" standard in the framework of learning from imbalanced data. This is …

[HTML][HTML] Learning from imbalanced data: open challenges and future directions

B Krawczyk - Progress in artificial intelligence, 2016 - Springer
Despite more than two decades of continuous development learning from imbalanced data
is still a focus of intense research. Starting as a problem of skewed distributions of binary …

An empirical comparison and evaluation of minority oversampling techniques on a large number of imbalanced datasets

G Kovács - Applied soft computing, 2019 - Elsevier
Learning and mining from imbalanced datasets gained increased interest in recent years.
One simple but efficient way to increase the performance of standard machine learning …

Stop oversampling for class imbalance learning: A review

AS Tarawneh, AB Hassanat, GA Altarawneh… - IEEe …, 2022 - ieeexplore.ieee.org
For the last two decades, oversampling has been employed to overcome the challenge of
learning from imbalanced datasets. Many approaches to solving this challenge have been …

Imbalanced deep learning by minority class incremental rectification

Q Dong, S Gong, X Zhu - IEEE transactions on pattern analysis …, 2018 - ieeexplore.ieee.org
Model learning from class imbalanced training data is a long-standing and significant
challenge for machine learning. In particular, existing deep learning methods consider …

A comprehensive survey on rare event prediction

C Shyalika, R Wickramarachchi, AP Sheth - ACM Computing Surveys, 2024 - dl.acm.org
Rare event prediction involves identifying and forecasting events with a low probability using
machine learning (ML) and data analysis. Due to the imbalanced data distributions, where …

SMOTE for handling imbalanced data problem: A review

GA Pradipta, R Wardoyo, A Musdholifah… - … on informatics and …, 2021 - ieeexplore.ieee.org
Imbalanced class data distribution occurs when the number of examples representing one
class is much lower than others. This conditioning affects the prediction accuracy degraded …

Confusion-matrix-based kernel logistic regression for imbalanced data classification

M Ohsaki, P Wang, K Matsuda… - … on Knowledge and …, 2017 - ieeexplore.ieee.org
There have been many attempts to classify imbalanced data, since this classification is
critical in a wide variety of applications related to the detection of anomalies, failures, and …

A regularized ensemble framework of deep learning for cancer detection from multi-class, imbalanced training data

X Yuan, L **e, M Abouelenien - Pattern Recognition, 2018 - Elsevier
In medical diagnosis, eg bowel cancer detection, a large number of examples of normal
cases exists with a much smaller number of positive cases. Such data imbalance usually …

Fabric defect detection using activation layer embedded convolutional neural network

W Ouyang, B Xu, J Hou, X Yuan - IEEE Access, 2019 - ieeexplore.ieee.org
Loom malfunctions are the main cause of faulty fabric production. A fabric inspection system
is a specialized computer vision system used to detect fabric defects for quality assurance. In …