Online learning: A comprehensive survey

SCH Hoi, D Sahoo, J Lu, P Zhao - Neurocomputing, 2021 - Elsevier
Online learning represents a family of machine learning methods, where a learner attempts
to tackle some predictive (or any type of decision-making) task by learning from a sequence …

Feature selection: A data perspective

J Li, K Cheng, S Wang, F Morstatter… - ACM computing …, 2017 - dl.acm.org
Feature selection, as a data preprocessing strategy, has been proven to be effective and
efficient in preparing data (especially high-dimensional data) for various data-mining and …

A survey of machine learning for big data processing

J Qiu, Q Wu, G Ding, Y Xu, S Feng - EURASIP Journal on Advances in …, 2016 - Springer
There is no doubt that big data are now rapidly expanding in all science and engineering
domains. While the potential of these massive data is undoubtedly significant, fully making …

Big data preprocessing: methods and prospects

S García, S Ramírez-Gallego, J Luengo, JM Benítez… - Big data analytics, 2016 - Springer
The massive growth in the scale of data has been observed in recent years being a key
factor of the Big Data scenario. Big Data can be defined as high volume, velocity and variety …

A survey on data preprocessing for data stream mining: Current status and future directions

S Ramírez-Gallego, B Krawczyk, S García, M Woźniak… - Neurocomputing, 2017 - Elsevier
Data preprocessing and reduction have become essential techniques in current knowledge
discovery scenarios, dominated by increasingly large datasets. These methods aim at …

Feature selection for text classification: A review

X Deng, Y Li, J Weng, J Zhang - Multimedia Tools and Applications, 2019 - Springer
Big multimedia data is heterogeneous in essence, that is, the data may be a mixture of
video, audio, text, and images. This is due to the prevalence of novel applications in recent …

Malicious URL detection using machine learning: A survey

D Sahoo, C Liu, SCH Hoi - arxiv preprint arxiv:1701.07179, 2017 - arxiv.org
Malicious URL, aka malicious website, is a common and serious threat to cybersecurity.
Malicious URLs host unsolicited content (spam, phishing, drive-by exploits, etc.) and lure …

[PDF][PDF] Feature selection for classification: A review

J Tang, S Alelyani, H Liu - Data classification: Algorithms and …, 2014 - math.chalmers.se
Nowadays, the growth of the high-throughput technologies has resulted in exponential
growth in the harvested data with respect to both dimensionality and sample size. The trend …

Recent advances in feature selection and its applications

Y Li, T Li, H Liu - Knowledge and Information Systems, 2017 - Springer
Feature selection is one of the key problems for machine learning and data mining. In this
review paper, a brief historical background of the field is given, followed by a selection of …

Feature selection for high-dimensional data

V Bolón-Canedo, N Sánchez-Maroño… - Progress in Artificial …, 2016 - Springer
This paper offers a comprehensive approach to feature selection in the scope of
classification problems, explaining the foundations, real application problems and the …