Stop oversampling for class imbalance learning: A review

AS Tarawneh, AB Hassanat, GA Altarawneh… - IEEe …, 2022 - ieeexplore.ieee.org
For the last two decades, oversampling has been employed to overcome the challenge of
learning from imbalanced datasets. Many approaches to solving this challenge have been …

The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey

R Sauber-Cole, TM Khoshgoftaar - Journal of Big Data, 2022 - Springer
The existence of class imbalance in a dataset can greatly bias the classifier towards majority
classification. This discrepancy can pose a serious problem for deep learning models, which …

Tabddpm: Modelling tabular data with diffusion models

A Kotelnikov, D Baranchuk… - International …, 2023 - proceedings.mlr.press
Denoising diffusion probabilistic models are becoming the leading generative modeling
paradigm for many important data modalities. Being the most prevalent in the computer …

Deep neural networks and tabular data: A survey

V Borisov, T Leemann, K Seßler, J Haug… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Heterogeneous tabular data are the most commonly used form of data and are essential for
numerous critical and computationally demanding applications. On homogeneous datasets …

Imbalanced data classification: A KNN and generative adversarial networks-based hybrid approach for intrusion detection

H Ding, L Chen, L Dong, Z Fu, X Cui - Future Generation Computer Systems, 2022 - Elsevier
With the continuous emergence of various network attacks, it is becoming more and more
important to ensure the security of the network. Intrusion detection, as one of the important …

Ctab-gan: Effective table data synthesizing

Z Zhao, A Kunar, R Birke… - Asian Conference on …, 2021 - proceedings.mlr.press
While data sharing is crucial for knowledge development, privacy concerns and strict
regulation (eg, European General Data Protection Regulation (GDPR)) unfortunately limit its …

Synthetic attack data generation model applying generative adversarial network for intrusion detection

V Kumar, D Sinha - Computers & Security, 2023 - Elsevier
Detecting a large number of attack classes accurately applying machine learning (ML) and
deep learning (DL) techniques depends on the number of representative samples available …

RGAN-EL: A GAN and ensemble learning-based hybrid approach for imbalanced data classification

H Ding, Y Sun, Z Wang, N Huang, Z Shen… - Information Processing & …, 2023 - Elsevier
Imbalanced sample distribution is usually the main reason for the performance degradation
of machine learning algorithms. Based on this, this study proposes a hybrid framework …

A survey on gan techniques for data augmentation to address the imbalanced data issues in credit card fraud detection

E Strelcenia, S Prakoonwit - Machine Learning and Knowledge Extraction, 2023 - mdpi.com
Data augmentation is an important procedure in deep learning. GAN-based data
augmentation can be utilized in many domains. For instance, in the credit card fraud domain …

Ctab-gan+: Enhancing tabular data synthesis

Z Zhao, A Kunar, R Birke, H Van der Scheer… - Frontiers in big …, 2024 - frontiersin.org
The usage of synthetic data is gaining momentum in part due to the unavailability of original
data due to privacy and legal considerations and in part due to its utility as an augmentation …