A review of the oversampling techniques in class imbalance problem

S Sharma, A Gosain, S Jain - … : Proceedings of ICICC 2021, Volume 1, 2022 - Springer
Class imbalance is often faced by real-world datasets where one class contains a smaller
number of instances than the other one. Even though this has been an area of interest for …

A comprehensive analysis of synthetic minority oversampling technique (SMOTE) for handling class imbalance

D Elreedy, AF Atiya - Information Sciences, 2019 - Elsevier
Imbalanced classification problems are often encountered in many applications. The
challenge is that there is a minority class that has typically very little data and is often the …

SMOTE-ENC: A novel SMOTE-based method to generate synthetic data for nominal and continuous features

M Mukherjee, M Khushi - Applied system innovation, 2021 - mdpi.com
Real-world datasets are heavily skewed where some classes are significantly outnumbered
by the other classes. In these situations, machine learning algorithms fail to achieve …

Multi-class imbalanced big data classification on spark

WC Sleeman IV, B Krawczyk - Knowledge-Based Systems, 2021 - Elsevier
Despite more than two decades of progress, learning from imbalanced data is still
considered as one of the contemporary challenges in machine learning. This has been …

A novel data augmentation approach to fault diagnosis with class-imbalance problem

J Tian, Y Jiang, J Zhang, H Luo, S Yin - Reliability Engineering & System …, 2024 - Elsevier
Data-driven fault diagnosis techniques are frequently applied to ensure the reliability and
safety of industrial systems. However, as a common challenge, the class-imbalance problem …

Using variational auto encoding in credit card fraud detection

H Tingfei, C Guangquan, H Kuihua - IEEE Access, 2020 - ieeexplore.ieee.org
Machine learning approaches are widely used to analyze and detect the increasingly
serious problem of credit card fraud. However, typical credit card datasets present …

Radial-based oversampling for multiclass imbalanced data classification

B Krawczyk, M Koziarski… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Learning from imbalanced data is among the most popular topics in the contemporary
machine learning. However, the vast majority of attention in this field is given to binary …

Synthetic oversampling with mahalanobis distance and local information for highly imbalanced class-overlapped data

Y Yan, L Zheng, S Han, C Yu, P Zhou - Expert Systems with Applications, 2025 - Elsevier
Minority oversampling is currently one of the most popular and effective methods for
handling imbalanced data. However, oversampling that relies on the observations of the …

Local distribution-based adaptive minority oversampling for imbalanced data classification

X Wang, J Xu, T Zeng, L **g - Neurocomputing, 2021 - Elsevier
Imbalanced data classification, as a challenging task, has drawn a significant interest in
numerous scientific areas. One popular strategy to balance the instance quantities between …

An empirical study toward dealing with noise and class imbalance issues in software defect prediction

SK Pandey, AK Tripathi - Soft Computing, 2021 - Springer
The quality of the defect datasets is a critical issue in the domain of software defect
prediction (SDP). These datasets are obtained through the mining of software repositories …