A brief survey of text mining: Classification, clustering and extraction techniques

M Allahyari, S Pouriyeh, M Assefi, S Safaei… - arxiv preprint arxiv …, 2017 - arxiv.org
The amount of text that is generated every day is increasing dramatically. This tremendous
volume of mostly unstructured text cannot be simply processed and perceived by computers …

An intelligent system for spam detection and identification of the most relevant features based on evolutionary random weight networks

H Faris, AZ Ala'M, AA Heidari, I Aljarah, M Mafarja… - Information …, 2019 - Elsevier
With the incremental use of emails as an essential and popular communication mean over
the Internet, there comes a serious threat that impacts the Internet and the society. This …

[HTML][HTML] Text categorization with WEKA: A survey

D Merlini, M Rossini - Machine Learning with Applications, 2021 - Elsevier
This work shows the use of WEKA, a tool that implements the most common machine
learning algorithms, to perform a Text Mining analysis on a set of documents. Applying these …

The impact of preprocessing on text classification

AK Uysal, S Gunal - Information processing & management, 2014 - Elsevier
Preprocessing is one of the key components in a typical text classification framework. This
paper aims to extensively examine the impact of preprocessing on text classification in terms …

Text mining of news-headlines for FOREX market prediction: A Multi-layer Dimension Reduction Algorithm with semantics and sentiment

AK Nassirtoussi, S Aghabozorgi, TY Wah… - Expert Systems with …, 2015 - Elsevier
In this paper a novel approach is proposed to predict intraday directional-movements of a
currency-pair in the foreign exchange market based on the text of breaking financial news …

An improved global feature selection scheme for text classification

AK Uysal - Expert systems with Applications, 2016 - Elsevier
Feature selection is known as a good solution to the high dimensionality of the feature space
and mostly preferred feature selection methods for text classification are filter-based ones. In …

A novel probabilistic feature selection method for text classification

AK Uysal, S Gunal - Knowledge-Based Systems, 2012 - Elsevier
High dimensionality of the feature space is one of the most important concerns in text
classification problems due to processing time and accuracy considerations. Selection of …

[HTML][HTML] A comprehensive study of spam detection in e-mails using bio-inspired optimization techniques

J Batra, R Jain, VA Tikkiwal, A Chakraborty - International Journal of …, 2021 - Elsevier
Electronic mail is a medium of communication used frequently for conveying a variety of
information. It has become an integral part of people's lives owing to its ease of access and …

[KIRJA][B] Social data analytics

A Beheshti, S Ghodratnama, M Elahi, H Farhood - 2022 - taylorfrancis.com
This book is an introduction to social data analytics along with its challenges and
opportunities in the age of Big Data and Artificial Intelligence. It focuses primarily on …

Relative discrimination criterion–A novel feature ranking method for text data

A Rehman, K Javed, HA Babri, M Saeed - Expert Systems with Applications, 2015 - Elsevier
High dimensionality of text data hinders the performance of classifiers making it necessary to
apply feature selection for dimensionality reduction. Most of the feature ranking metrics for …