A comprehensive analysis of synthetic minority oversampling technique (SMOTE) for handling class imbalance

D Elreedy, AF Atiya - Information Sciences, 2019 - Elsevier
Imbalanced classification problems are often encountered in many applications. The
challenge is that there is a minority class that has typically very little data and is often the …

SMOTE for high-dimensional class-imbalanced data

R Blagus, L Lusa - BMC bioinformatics, 2013 - Springer
Background Classification using class-imbalanced data is biased in favor of the majority
class. The bias is even larger for high-dimensional data, where the number of variables …

Enhancing machine learning-based sentiment analysis through feature extraction techniques

N A. Semary, W Ahmed, K Amin, P Pławiak… - Plos one, 2024 - journals.plos.org
A crucial part of sentiment classification is featuring extraction because it involves extracting
valuable information from text data, which affects the model's performance. The goal of this …

HPSLPred: an ensemble multi‐label classifier for human protein subcellular location prediction with imbalanced source

S Wan, Y Duan, Q Zou - Proteomics, 2017 - Wiley Online Library
Predicting the subcellular localization of proteins is an important and challenging problem.
Traditional experimental approaches are often expensive and time‐consuming …

Unboxing industry-standard AI models for male fertility prediction with SHAP

D GhoshRoy, PA Alvi, KC Santosh - Healthcare, 2023 - mdpi.com
Infertility is a social stigma for individuals, and male factors cause approximately 30% of
infertility. Despite this, male infertility is underrecognized and underrepresented as a …

[HTML][HTML] An imbalanced fault diagnosis method based on TFFO and CNN for rotating machinery

L Zhang, Y Liu, J Zhou, M Luo, S Pu, X Yang - Sensors, 2022 - mdpi.com
Deep learning-based fault diagnosis usually requires a rich supply of data, but fault samples
are scarce in practice, posing a considerable challenge for existing diagnosis approaches to …

Seabed modelling by means of airborne laser bathymetry data and imbalanced learning for offshore map**

T Kogut, A Tomczak, A Słowik, T Oberski - Sensors, 2022 - mdpi.com
An important problem associated with the aerial map** of the seabed is the precise
classification of point clouds characterizing the water surface, bottom, and bottom objects …

[HTML][HTML] A low-dose CT-based radiomic model to improve characterization and screening recall intervals of indeterminate prevalent pulmonary nodules

L Rundo, RE Ledda, C di Noia, E Sala, G Mauri… - Diagnostics, 2021 - mdpi.com
Lung cancer (LC) is currently one of the main causes of cancer-related deaths worldwide.
Low-dose computed tomography (LDCT) of the chest has been proven effective in …

Unbalanced Fault Diagnosis Based on an Invariant Temporal‐Spatial Attention Fusion Network

J Liu, H Yang, J He, Z Sheng… - Computational …, 2022 - Wiley Online Library
The health status of mechanical bearings concerns the safety of equipment usage.
Therefore, it is of crucial importance to monitor mechanical bearings. Currently, deep …

Mining chemical activity status from high-throughput screening assays

O Soufan, W Ba-Alawi, M Afeef, M Essack, V Rodionov… - PloS one, 2015 - journals.plos.org
High-throughput screening (HTS) experiments provide a valuable resource that reports
biological activity of numerous chemical compounds relative to their molecular targets …