CatBoost for big data: an interdisciplinary review
Abstract Gradient Boosted Decision Trees (GBDT's) are a powerful tool for classification and
regression tasks in Big Data. Researchers should be familiar with the strengths and …
regression tasks in Big Data. Researchers should be familiar with the strengths and …
Deep learning based vulnerability detection: Are we there yet?
Automated detection of software vulnerabilities is a fundamental problem in software
security. Existing program analysis techniques either suffer from high false positives or false …
security. Existing program analysis techniques either suffer from high false positives or false …
A systematic review on imbalanced data challenges in machine learning: Applications and solutions
In machine learning, the data imbalance imposes challenges to perform data analytics in
almost all areas of real-world research. The raw primary data often suffers from the skewed …
almost all areas of real-world research. The raw primary data often suffers from the skewed …
[图书][B] Learning from imbalanced data sets
Learning with imbalanced data refers to the scenario in which the amounts of instances that
represent the concepts in a given problem follow a different distribution. The main issue …
represent the concepts in a given problem follow a different distribution. The main issue …
A broad review on class imbalance learning techniques
S Rezvani, X Wang - Applied Soft Computing, 2023 - Elsevier
The imbalanced learning issue is related to the performance of learning algorithms in the
presence of asymmetrical class distribution. Due to the complex characteristics of …
presence of asymmetrical class distribution. Due to the complex characteristics of …
SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary
The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is
considered" de facto" standard in the framework of learning from imbalanced data. This is …
considered" de facto" standard in the framework of learning from imbalanced data. This is …
On the class overlap problem in imbalanced data classification
Class imbalance is an active research area in the machine learning community. However,
existing and recent literature showed that class overlap had a higher negative impact on the …
existing and recent literature showed that class overlap had a higher negative impact on the …
FW-SMOTE: A feature-weighted oversampling approach for imbalanced classification
Abstract The Synthetic Minority Over-sampling Technique (SMOTE) is a well-known
resampling strategy that has been successfully used for dealing with the class-imbalance …
resampling strategy that has been successfully used for dealing with the class-imbalance …
Class-imbalanced dynamic financial distress prediction based on Adaboost-SVM ensemble combined with SMOTE and time weighting
This paper focuses on how to effectively construct dynamic financial distress prediction
models based on class-imbalanced data streams. Two class-imbalanced dynamic financial …
models based on class-imbalanced data streams. Two class-imbalanced dynamic financial …
Effective data generation for imbalanced learning using conditional generative adversarial networks
Learning from imbalanced datasets is a frequent but challenging task for standard
classification algorithms. Although there are different strategies to address this problem …
classification algorithms. Although there are different strategies to address this problem …