CatBoost for big data: an interdisciplinary review

JT Hancock, TM Khoshgoftaar - Journal of big data, 2020 - Springer
Abstract Gradient Boosted Decision Trees (GBDT's) are a powerful tool for classification and
regression tasks in Big Data. Researchers should be familiar with the strengths and …

Deep learning based vulnerability detection: Are we there yet?

S Chakraborty, R Krishna, Y Ding… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Automated detection of software vulnerabilities is a fundamental problem in software
security. Existing program analysis techniques either suffer from high false positives or false …

A systematic review on imbalanced data challenges in machine learning: Applications and solutions

H Kaur, HS Pannu, AK Malhi - ACM computing surveys (CSUR), 2019 - dl.acm.org
In machine learning, the data imbalance imposes challenges to perform data analytics in
almost all areas of real-world research. The raw primary data often suffers from the skewed …

[图书][B] Learning from imbalanced data sets

Learning with imbalanced data refers to the scenario in which the amounts of instances that
represent the concepts in a given problem follow a different distribution. The main issue …

A broad review on class imbalance learning techniques

S Rezvani, X Wang - Applied Soft Computing, 2023 - Elsevier
The imbalanced learning issue is related to the performance of learning algorithms in the
presence of asymmetrical class distribution. Due to the complex characteristics of …

SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary

A Fernández, S Garcia, F Herrera, NV Chawla - Journal of artificial …, 2018 - jair.org
The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is
considered" de facto" standard in the framework of learning from imbalanced data. This is …

On the class overlap problem in imbalanced data classification

P Vuttipittayamongkol, E Elyan, A Petrovski - Knowledge-based systems, 2021 - Elsevier
Class imbalance is an active research area in the machine learning community. However,
existing and recent literature showed that class overlap had a higher negative impact on the …

FW-SMOTE: A feature-weighted oversampling approach for imbalanced classification

S Maldonado, C Vairetti, A Fernandez, F Herrera - Pattern Recognition, 2022 - Elsevier
Abstract The Synthetic Minority Over-sampling Technique (SMOTE) is a well-known
resampling strategy that has been successfully used for dealing with the class-imbalance …

Class-imbalanced dynamic financial distress prediction based on Adaboost-SVM ensemble combined with SMOTE and time weighting

J Sun, H Li, H Fujita, B Fu, W Ai - Information Fusion, 2020 - Elsevier
This paper focuses on how to effectively construct dynamic financial distress prediction
models based on class-imbalanced data streams. Two class-imbalanced dynamic financial …

Effective data generation for imbalanced learning using conditional generative adversarial networks

G Douzas, F Bacao - Expert Systems with applications, 2018 - Elsevier
Learning from imbalanced datasets is a frequent but challenging task for standard
classification algorithms. Although there are different strategies to address this problem …