[HTML][HTML] Challenges and opportunities of generative models on tabular data

AX Wang, SS Chukova, CR Simpson, BP Nguyen - Applied Soft Computing, 2024 - Elsevier
Tabular data, organized like tables with rows and columns, is widely used. Existing models
for tabular data synthesis often face limitations related to data size or complexity. In contrast …

Review of resampling techniques for the treatment of imbalanced industrial data classification in equipment condition monitoring

Y Yuan, J Wei, H Huang, W Jiao, J Wang… - … Applications of Artificial …, 2023 - Elsevier
In an actual industrial scenario, machines typically operate normally for the majority of the
time, with malfunctions occurring only occasionally. As a result, there is very little recorded …

Class-overlap undersampling based on Schur decomposition for Class-imbalance problems

Q Dai, J Liu, Y Shi - Expert Systems with Applications, 2023 - Elsevier
The class-imbalance problem is an important area that plagues machine learning and data
mining researchers. It is ubiquitous in all areas of the real world. At present, many methods …

Software defect prediction ensemble learning algorithm based on adaptive variable sparrow search algorithm

Y Tang, Q Dai, M Yang, T Du, L Chen - International Journal of Machine …, 2023 - Springer
Software defect prediction has caused widespread concern among software engineering
researchers, which aims to erect a software defect prediction model according to historical …

Imbalanced customer churn classification using a new multi-strategy collaborative processing method

C Rao, Y Xu, X **ao, F Hu, M Goh - Expert Systems with Applications, 2024 - Elsevier
The rapid advancement of big data and artificial intelligence heralds a dual-edged era of
opportunities and challenges for the banking sector. Indeed, enhancing a model's capability …

MGSFformer: A multi-granularity spatiotemporal fusion transformer for air quality prediction

C Yu, F Wang, Y Wang, Z Shao, T Sun, D Yao, Y Xu - Information Fusion, 2025 - Elsevier
Air quality spatiotemporal prediction can provide technical support for environmental
governance and sustainable city development. As a classic multi-source spatiotemporal …

Two-step ensemble under-sampling algorithm for massive imbalanced data classification

L Bai, T Ju, H Wang, M Lei, X Pan - Information Sciences, 2024 - Elsevier
Imbalanced data classification is a challenging problem in the field of machine learning.
Class imbalance, class overlap, and large data volume significantly affect classification …

Class-overlap detection based on heterogeneous clustering ensemble for multi-class imbalance problem

Q Dai, L Wang, K Xu, T Du, L Chen - Expert Systems with Applications, 2024 - Elsevier
The class imbalance problem is one of the main challenges that hinders classifiers from
identifying unknown instances. When class distribution imbalance and class overlap coexist …

SWSEL: Sliding Window-based Selective Ensemble Learning for class-imbalance problems

Q Dai, J Liu, JP Yang - Engineering Applications of Artificial Intelligence, 2023 - Elsevier
For class-imbalance problems, traditional supervised learning algorithms tend to favor
majority instances (also called negative instances). Therefore, it is difficult for them to …

Hybrid resampling and weighted majority voting for multi-class anomaly detection on imbalanced malware and network traffic data

L Xue, T Zhu - Engineering Applications of Artificial Intelligence, 2024 - Elsevier
In a large skewed dataset, the data imbalance is severe and the classifier's accuracy is
biased towards the majority class. Insufficient data makes it challenging for the classifier to …