A survey of machine learning for big data processing

J Qiu, Q Wu, G Ding, Y Xu, S Feng - EURASIP Journal on Advances in …, 2016 - Springer
There is no doubt that big data are now rapidly expanding in all science and engineering
domains. While the potential of these massive data is undoubtedly significant, fully making …

On the joint-effect of class imbalance and overlap: a critical review

MS Santos, PH Abreu, N Japkowicz… - Artificial Intelligence …, 2022 - Springer
Current research on imbalanced data recognises that class imbalance is aggravated by
other data intrinsic characteristics, among which class overlap stands out as one of the most …

How complex is your classification problem? a survey on measuring classification complexity

AC Lorena, LPF Garcia, J Lehmann… - ACM Computing …, 2019 - dl.acm.org
Characteristics extracted from the training datasets of classification problems have proven to
be effective predictors in a number of meta-analyses. Among them, measures of …

Dynamic selection of normalization techniques using data complexity measures

S Jain, S Shukla, R Wadhvani - Expert Systems with Applications, 2018 - Elsevier
Data preprocessing is an important step for designing classification model. Normalization is
one of the preprocessing techniques used to handle the out-of-bounds attributes. This work …

A unifying view of class overlap and imbalance: Key concepts, multi-view panorama, and open avenues for research

MS Santos, PH Abreu, N Japkowicz, A Fernández… - Information …, 2023 - Elsevier
The combination of class imbalance and overlap is currently one of the most challenging
issues in machine learning. While seminal work focused on establishing class overlap as a …

A literature survey and empirical study of meta-learning for classifier selection

I Khan, X Zhang, M Rehman, R Ali - IEEE Access, 2020 - ieeexplore.ieee.org
Classification is the key and most widely studied paradigm in machine learning community.
The selection of appropriate classification algorithm for a particular problem is a challenging …

Few-shot aspect category sentiment analysis via meta-learning

B Liang, X Li, L Gui, Y Fu, Y He, M Yang… - ACM Transactions on …, 2023 - dl.acm.org
Existing aspect-based/category sentiment analysis methods have shown great success in
detecting sentiment polarity toward a given aspect in a sentence with supervised learning …

Evidential instance selection for K-nearest neighbor classification of big data

C Gong, Z Su, P Wang, Q Wang, Y You - International Journal of …, 2021 - Elsevier
Many instance selection algorithms have been introduced to reduce the high storage
requirements and computation complexity of K-nearest neighbor (K-NN) classification rules …

Relating instance hardness to classification performance in a dataset: a visual approach

PYA Paiva, CC Moreno, K Smith-Miles, MG Valeriano… - Machine Learning, 2022 - Springer
Abstract Machine Learning studies often involve a series of computational experiments in
which the predictive performance of multiple models are compared across one or more …

Data complexity meta-features for regression problems

AC Lorena, AI Maciel, PBC de Miranda, IG Costa… - Machine Learning, 2018 - Springer
In meta-learning, classification problems can be described by a variety of features, including
complexity measures. These measures allow capturing the complexity of the frontier that …