A survey of predictive modeling on imbalanced domains

P Branco, L Torgo, RP Ribeiro - ACM computing surveys (CSUR), 2016 - dl.acm.org
Many real-world data-mining applications involve obtaining predictive models using
datasets with strongly imbalanced distributions of the target variable. Frequently, the least …

Concrete 3D printing: Process parameters for process control, monitoring and diagnosis in automation and construction

TKN Quah, YWD Tay, JH Lim, MJ Tan, TN Wong… - Mathematics, 2023 - mdpi.com
In Singapore, there is an increasing need for independence from manpower within the
Building and Construction (B&C) Industry. Prefabricated Prefinished Volumetric Construction …

[HTML][HTML] The impact of class imbalance in classification performance metrics based on the binary confusion matrix

A Luque, A Carrasco, A Martín, A de Las Heras - Pattern Recognition, 2019 - Elsevier
A major issue in the classification of class imbalanced datasets involves the determination of
the most suitable performance metrics to be used. In previous work using several examples …

Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric

S Boughorbel, F Jarray, M El-Anbari - PloS one, 2017 - journals.plos.org
Data imbalance is frequently encountered in biomedical applications. Resampling
techniques can be used in binary classification to tackle this issue. However such solutions …

GAN augmentation to deal with imbalance in imaging-based intrusion detection

G Andresini, A Appice, L De Rose, D Malerba - Future Generation …, 2021 - Elsevier
Nowadays attacks on computer networks continue to advance at a rate outpacing cyber
defenders' ability to write new attack signatures. This paper illustrates a deep learning …

SMOTE for high-dimensional class-imbalanced data

R Blagus, L Lusa - BMC bioinformatics, 2013 - Springer
Background Classification using class-imbalanced data is biased in favor of the majority
class. The bias is even larger for high-dimensional data, where the number of variables …

A novel SMOTE-based resampling technique trough noise detection and the boosting procedure

F Sağlam, MA Cengiz - Expert Systems with Applications, 2022 - Elsevier
Most of the classification methods assume that the numbers of class observations are
balanced. In such cases, models are predicted by giving biased weight to the the class with …

Strength of stacking technique of ensemble learning in rockburst prediction with imbalanced data: Comparison of eight single and ensemble models

X Yin, Q Liu, Y Pan, X Huang, J Wu, X Wang - Natural Resources …, 2021 - Springer
Rockburst is a common dynamic geological hazard, severely restricting the development
and utilization of underground space and resources. As the depth of excavation and mining …

On the effectiveness of preprocessing methods when dealing with different levels of class imbalance

V García, JS Sánchez, RA Mollineda - Knowledge-Based Systems, 2012 - Elsevier
The present paper investigates the influence of both the imbalance ratio and the classifier on
the performance of several resampling strategies to deal with imbalanced data sets. The …

Measuring and comparing the accuracy of species distribution models with presence–absence data

C Liu, M White, G Newell - Ecography, 2011 - Wiley Online Library
Species distribution models have been widely used to predict species distributions for
various purposes, including conservation planning, and climate change impact assessment …