Survey on synthetic data generation, evaluation methods and GANs

A Figueira, B Vaz - Mathematics, 2022 - mdpi.com
Synthetic data consists of artificially generated data. When data are scarce, or of poor
quality, synthetic data can be used, for example, to improve the performance of machine …

One-class support vector classifiers: A survey

S Alam, SK Sonbhadra, S Agarwal… - Knowledge-Based …, 2020 - Elsevier
Over the past two decades, one-class classification (OCC) becomes very popular due to its
diversified applicability in data mining and pattern recognition problems. Concerning to …

A benchmark of machine learning approaches for credit score prediction

V Moscato, A Picariello, G Sperlí - Expert Systems with Applications, 2021 - Elsevier
Credit risk assessment plays a key role for correctly supporting financial institutes in defining
their bank policies and commercial strategies. Over the last decade, the emerging of social …

On the class overlap problem in imbalanced data classification

P Vuttipittayamongkol, E Elyan, A Petrovski - Knowledge-based systems, 2021 - Elsevier
Class imbalance is an active research area in the machine learning community. However,
existing and recent literature showed that class overlap had a higher negative impact on the …

Intelligent fault diagnosis of rolling bearings based on normalized CNN considering data imbalance and variable working conditions

B Zhao, X Zhang, H Li, Z Yang - Knowledge-Based Systems, 2020 - Elsevier
Intelligent fault detection and diagnosis, as an important approach, play a crucial role in
ensuring the stable, reliable and safe operation of rolling bearings, which is one of the most …

Class-imbalanced dynamic financial distress prediction based on Adaboost-SVM ensemble combined with SMOTE and time weighting

J Sun, H Li, H Fujita, B Fu, W Ai - Information Fusion, 2020 - Elsevier
This paper focuses on how to effectively construct dynamic financial distress prediction
models based on class-imbalanced data streams. Two class-imbalanced dynamic financial …

A new deep learning ensemble credit risk evaluation model with an improved synthetic minority oversampling technique

F Shen, X Zhao, G Kou, FE Alsaadi - Applied Soft Computing, 2021 - Elsevier
In recent years, research has found that in many credit risk evaluation domains, deep
learning is superior to traditional machine learning methods and classifier ensembles …

[HTML][HTML] A hybrid sampling algorithm combining M-SMOTE and ENN based on Random forest for medical imbalanced data

Z Xu, D Shen, T Nie, Y Kou - Journal of Biomedical Informatics, 2020 - Elsevier
The problem of imbalanced data classification often exists in medical diagnosis. Traditional
classification algorithms usually assume that the number of samples in each class is similar …

A cluster-based oversampling algorithm combining SMOTE and k-means for imbalanced medical data

Z Xu, D Shen, T Nie, Y Kou, N Yin, X Han - Information Sciences, 2021 - Elsevier
The algorithm of C4. 5 decision tree has the advantages of high classification accuracy, fast
calculation speed and comprehensible classification rules, so it is widely used for medical …

I-SiamIDS: an improved Siam-IDS for handling class imbalance in network-based intrusion detection systems

P Bedi, N Gupta, V **dal - Applied Intelligence, 2021 - Springer
Abstract Network-based Intrusion Detection Systems (NIDSs) identify malicious activities by
analyzing network traffic. NIDSs are trained with the samples of benign and intrusive …