A brief survey of text mining: Classification, clustering and extraction techniques
The amount of text that is generated every day is increasing dramatically. This tremendous
volume of mostly unstructured text cannot be simply processed and perceived by computers …
volume of mostly unstructured text cannot be simply processed and perceived by computers …
An intelligent system for spam detection and identification of the most relevant features based on evolutionary random weight networks
With the incremental use of emails as an essential and popular communication mean over
the Internet, there comes a serious threat that impacts the Internet and the society. This …
the Internet, there comes a serious threat that impacts the Internet and the society. This …
[HTML][HTML] Text categorization with WEKA: A survey
This work shows the use of WEKA, a tool that implements the most common machine
learning algorithms, to perform a Text Mining analysis on a set of documents. Applying these …
learning algorithms, to perform a Text Mining analysis on a set of documents. Applying these …
The impact of preprocessing on text classification
Preprocessing is one of the key components in a typical text classification framework. This
paper aims to extensively examine the impact of preprocessing on text classification in terms …
paper aims to extensively examine the impact of preprocessing on text classification in terms …
Text mining of news-headlines for FOREX market prediction: A Multi-layer Dimension Reduction Algorithm with semantics and sentiment
In this paper a novel approach is proposed to predict intraday directional-movements of a
currency-pair in the foreign exchange market based on the text of breaking financial news …
currency-pair in the foreign exchange market based on the text of breaking financial news …
An improved global feature selection scheme for text classification
AK Uysal - Expert systems with Applications, 2016 - Elsevier
Feature selection is known as a good solution to the high dimensionality of the feature space
and mostly preferred feature selection methods for text classification are filter-based ones. In …
and mostly preferred feature selection methods for text classification are filter-based ones. In …
A novel probabilistic feature selection method for text classification
High dimensionality of the feature space is one of the most important concerns in text
classification problems due to processing time and accuracy considerations. Selection of …
classification problems due to processing time and accuracy considerations. Selection of …
[HTML][HTML] A comprehensive study of spam detection in e-mails using bio-inspired optimization techniques
J Batra, R Jain, VA Tikkiwal, A Chakraborty - International Journal of …, 2021 - Elsevier
Electronic mail is a medium of communication used frequently for conveying a variety of
information. It has become an integral part of people's lives owing to its ease of access and …
information. It has become an integral part of people's lives owing to its ease of access and …
[KIRJA][B] Social data analytics
This book is an introduction to social data analytics along with its challenges and
opportunities in the age of Big Data and Artificial Intelligence. It focuses primarily on …
opportunities in the age of Big Data and Artificial Intelligence. It focuses primarily on …
Relative discrimination criterion–A novel feature ranking method for text data
High dimensionality of text data hinders the performance of classifiers making it necessary to
apply feature selection for dimensionality reduction. Most of the feature ranking metrics for …
apply feature selection for dimensionality reduction. Most of the feature ranking metrics for …