[PDF][PDF] A comparative study on feature selection in text categorization

Y Yang, JO Pedersen - icml, 1997 - Citeseer
This paper is a comparative study of feature selection methods in statistical learning of text
categorization. The focus is on aggressive dimensionality reduction. Five methods were …

[BOOK][B] Machine learning for text: An introduction

CC Aggarwal, CC Aggarwal - 2018 - Springer
The extraction of useful insights from text with various types of statistical algorithms is
referred to as text mining, text analytics, or machine learning from text. The choice of …

A survey of text clustering algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
Clustering is a widely studied data mining problem in the text domains. The problem finds
numerous applications in customer segmentation, classification, collaborative filtering …

[HTML][HTML] Multi-class sentiment classification on Bengali social media comments using machine learning

R Haque, N Islam, M Tasneem, AK Das - International journal of cognitive …, 2023 - Elsevier
Abstract Multi-class Sentiment Analysis (SA) is an important field of computational linguistics
that extracts multiple opinions expressed in a text using NLP and text-mining techniques …

Temporal patterns of happiness and information in a global social network: Hedonometrics and Twitter

PS Dodds, KD Harris, IM Kloumann, CA Bliss… - PloS one, 2011 - journals.plos.org
Individual happiness is a fundamental societal metric. Normally measured through self-
report, happiness has often been indirectly characterized and overshadowed by more …

Essential elements of natural language processing: what the radiologist should know

PH Chen - Academic radiology, 2020 - Elsevier
Natural language is ubiquitous in the workflow of medical imaging. Radiologists create and
consume free text in their daily work, some of which can be amenable to enhancements …

Machine learning in medicine: a practical introduction to natural language processing

CJ Harrison, CJ Sidey-Gibbons - BMC medical research methodology, 2021 - Springer
Background Unstructured text, including medical records, patient feedback, and social
media comments, can be a rich source of data for clinical research. Natural language …

Stopwords in technical language processing

S Sarica, J Luo - Plos one, 2021 - journals.plos.org
There are increasing applications of natural language processing techniques for information
retrieval, indexing, topic modelling and text classification in engineering contexts. A standard …

Temporal topic modeling applied to aviation safety reports: A subject matter expert review

SD Robinson - Safety science, 2019 - Elsevier
The aviation safety reporting system database has seen many applications of topic modeling
and natural language processing. Its size, metadata, and narratives have made it ideal for …

Comparing grounded theory and topic modeling: Extreme divergence or unlikely convergence?

EPS Baumer, D Mimno, S Guha… - Journal of the …, 2017 - Wiley Online Library
Researchers in information science and related areas have developed various methods for
analyzing textual data, such as survey responses. This article describes the application of …