[HTML][HTML] A review of ensemble learning and data augmentation models for class imbalanced problems: Combination, implementation and evaluation

AA Khan, O Chaudhari, R Chandra - Expert Systems with Applications, 2024 - Elsevier
Class imbalance (CI) in classification problems arises when the number of observations
belonging to one class is lower than the other. Ensemble learning combines multiple models …

AWGAN: An adaptive weighting GAN approach for oversampling imbalanced datasets

S Guan, X Zhao, Y Xue, H Pan - Information Sciences, 2024 - Elsevier
Oversampling is a widely employed technique for addressing imbalanced datasets, facing
challenges like class overlaps, intra-class imbalance, and noise. In this paper, we introduce …

Class-overlap detection based on heterogeneous clustering ensemble for multi-class imbalance problem

Q Dai, L Wang, K Xu, T Du, L Chen - Expert Systems with Applications, 2024 - Elsevier
The class imbalance problem is one of the main challenges that hinders classifiers from
identifying unknown instances. When class distribution imbalance and class overlap coexist …

SWSEL: Sliding Window-based Selective Ensemble Learning for class-imbalance problems

Q Dai, J Liu, JP Yang - Engineering Applications of Artificial Intelligence, 2023 - Elsevier
For class-imbalance problems, traditional supervised learning algorithms tend to favor
majority instances (also called negative instances). Therefore, it is difficult for them to …

Lightweight intrusion detection model based on CNN and knowledge distillation

LH Wang, Q Dai, T Du, L Chen - Applied Soft Computing, 2024 - Elsevier
The problem of network attacks is a primary focus in the domain of intrusion detection.
Models face significant challenges in recognizing intrusion behaviors, particularly when …

A diversity and reliability-enhanced synthetic minority oversampling technique for multi-label learning

Y Gong, Q Wu, M Zhou, C Chen - Information Sciences, 2025 - Elsevier
The class imbalance issue is generally intrinsic in multi-label datasets due to the fact that
they have a large number of labels and each sample is associated with only a few of them …

A software defect prediction method based on learnable three-line hybrid feature fusion

Y Tang, Q Dai, Y Du, L Chen, X Niu - Expert Systems with Applications, 2024 - Elsevier
Software defect prediction (SDP) plays a crucial role in ensuring the security and quality of
software systems. However, it faces challenges posed by high-dimensional features present …

Dynamic classification ensembles for handling imbalanced multiclass drifted data streams

AH Madkour, HM Abdelkader, AM Mohammed - Information Sciences, 2024 - Elsevier
Abstract Machine learning models often encounter significant difficulties when dealing with
multiclass imbalanced data streams in nonstationary environments. These challenges can …

Undersampling based on generalized learning vector quantization and natural nearest neighbors for imbalanced data

LH Wang, Q Dai, JY Wang, T Du, L Chen - International Journal of Machine …, 2024 - Springer
Imbalanced datasets can adversely affect classifier performance. Conventional
undersampling approaches may lead to the loss of essential information, while …

Instance gravity oversampling method for software defect prediction

Y Tang, Y Zhou, C Yang, Y Du, M Yang - Information and Software …, 2025 - Elsevier
Context In the software defect datasets, the number of defective instances is significantly
lower than that of non-defective instances. This imbalance adversely impacts the predictive …