Sentiment analysis and the complex natural language

MT Khan, M Durrani, A Ali, I Inayat, S Khalid… - Complex Adaptive …, 2016 - Springer
There is huge amount of content produced online by amateur authors, covering a large
variety of topics. Sentiment analysis (SA) extracts and aggregates users' sentiments towards …

A Machine learning approach for Post-Disaster data curation

SH Ro, Y Li, J Gong - Advanced Engineering Informatics, 2024 - Elsevier
Image data collected after natural disasters play an important role in the forensics of
structure failures. However, curating and managing large amounts of post-disaster imagery …

Classifying web documents in a hierarchy of categories: a comprehensive study

M Ceci, D Malerba - Journal of Intelligent Information Systems, 2007 - Springer
Most of the research on text categorization has focused on classifying text documents into a
set of categories with no structural relationships among them (flat classification). However, in …

Effect of term distributions on centroid-based text categorization

V Lertnattee, T Theeramunkong - Information Sciences, 2004 - Elsevier
Most of traditional text categorization approaches utilize term frequency (tf) and inverse
document frequency (idf) for representing importance of words and/or terms in classifying a …

A high performance centroid-based classification approach for language identification

H Takçı, T Güngör - Pattern Recognition Letters, 2012 - Elsevier
Centroid-based classification is a machine learning approach used in the text classification
domain. The main advantage of centroid-based classifiers is their high performance during …

[PDF][PDF] Improving methods for single-label text categorization

A Cachopo - Instituto Superior Técnico, Portugal, 2007 - academia.edu
As the volume of information in digital form increases, the use of Text Categorization
techniques aimed at finding relevant information becomes more necessary. To improve the …

Semi-supervised single-label text categorization using centroid-based classifiers

A Cardoso-Cachopo, AL Oliveira - … of the 2007 ACM symposium on …, 2007 - dl.acm.org
In this paper we study the effect of using unlabeled data in conjunction with a small portion
of labeled data on the accuracy of a centroid-based classifier used to perform single-label …

A study on optimal parameter tuning for Rocchio text classifier

A Moschitti - European Conference on Information Retrieval, 2003 - Springer
Current trend in operational text categorization is the designing of fast classification tools.
Several studies on improving accuracy of fast but less accurate classifiers have been …

Improving feature selection techniques for machine learning

F Tan - 2007 - scholarworks.gsu.edu
As a commonly used technique in data preprocessing for machine learning, feature
selection identifies important features and removes irrelevant, redundant or noise features to …

Class normalization in centroid-based text categorization

V Lertnattee, T Theeramunkong - Information Sciences, 2006 - Elsevier
Centroid-based categorization is one of the most popular algorithms in text classification. In
this approach, normalization is an important factor to improve performance of a centroid …