SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary
The Synthetic Minority Oversampling Technique (SMOTE) preprocessing algorithm is
considered" de facto" standard in the framework of learning from imbalanced data. This is …
considered" de facto" standard in the framework of learning from imbalanced data. This is …
A survey of predictive modeling on imbalanced domains
Many real-world data-mining applications involve obtaining predictive models using
datasets with strongly imbalanced distributions of the target variable. Frequently, the least …
datasets with strongly imbalanced distributions of the target variable. Frequently, the least …
Predictive performance of presence‐only species distribution models: a benchmark study with reproducible code
Species distribution modeling (SDM) is widely used in ecology and conservation. Currently,
the most available data for SDM are species presence‐only records (available through …
the most available data for SDM are species presence‐only records (available through …
The class imbalance problem in deep learning
Deep learning has recently unleashed the ability for Machine learning (ML) to make
unparalleled strides. It did so by confronting and successfully addressing, at least to a …
unparalleled strides. It did so by confronting and successfully addressing, at least to a …
On the class overlap problem in imbalanced data classification
Class imbalance is an active research area in the machine learning community. However,
existing and recent literature showed that class overlap had a higher negative impact on the …
existing and recent literature showed that class overlap had a higher negative impact on the …
A hybrid method with dynamic weighted entropy for handling the problem of class imbalance with overlap in credit card fraud detection
Z Li, M Huang, G Liu, C Jiang - Expert Systems with Applications, 2021 - Elsevier
Class imbalance with overlap is a very challenging problem in electronic fraud transaction
detection. Fraudsters have racked their brains to make a fraud transaction as similar as a …
detection. Fraudsters have racked their brains to make a fraud transaction as similar as a …
GHOST: adjusting the decision threshold to handle imbalanced data in machine learning
Machine learning classifiers trained on class imbalanced data are prone to overpredict the
majority class. This leads to a larger misclassification rate for the minority class, which in …
majority class. This leads to a larger misclassification rate for the minority class, which in …
A systematic review on imbalanced learning methods in intelligent fault diagnosis
The theoretical developments of data-driven fault diagnosis methods have yielded fruitful
achievements and significantly benefited industry practices. However, most methods are …
achievements and significantly benefited industry practices. However, most methods are …
Modelling species presence‐only data with random forests
The random forest (RF) algorithm is an ensemble of classification or regression trees and is
widely used, including for species distribution modelling (SDM). Many researchers use …
widely used, including for species distribution modelling (SDM). Many researchers use …
On the joint-effect of class imbalance and overlap: a critical review
Current research on imbalanced data recognises that class imbalance is aggravated by
other data intrinsic characteristics, among which class overlap stands out as one of the most …
other data intrinsic characteristics, among which class overlap stands out as one of the most …