A brief survey of text mining: Classification, clustering and extraction techniques

M Allahyari, S Pouriyeh, M Assefi, S Safaei… - arxiv preprint arxiv …, 2017 - arxiv.org
The amount of text that is generated every day is increasing dramatically. This tremendous
volume of mostly unstructured text cannot be simply processed and perceived by computers …

Machine learning in automated text categorization

F Sebastiani - ACM computing surveys (CSUR), 2002 - dl.acm.org
The automated categorization (or classification) of texts into predefined categories has
witnessed a booming interest in the last 10 years, due to the increased availability of …

Deep hierarchical semantic segmentation

L Li, T Zhou, W Wang, J Li… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Humans are able to recognize structured relations in observation, allowing us to decompose
complex scenes into simpler parts and abstract the visual world in multiple levels. However …

A survey of text classification algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
The problem of classification has been widely studied in the data mining, machine learning,
database, and information retrieval communities with applications in a number of diverse …

A survey of hierarchical classification across different application domains

CN Silla, AA Freitas - Data mining and knowledge discovery, 2011 - Springer
In this survey we discuss the task of hierarchical classification. The literature about this field
is scattered across very different application domains and for that reason research in one …

[BOEK][B] An introduction to information retrieval

CD Manning - 2009 - edl.emi.gov.et
As recently as the 1990s, studies showed that most people preferred getting information
from other people rather than from information retrieval systems. Of course, in that time …

Semi-Supervised Learning (Chapelle, O. et al., Eds.; 2006) [Book reviews]

O Chapelle, B Scholkopf, A Zien - IEEE Transactions on Neural …, 2009 - ieeexplore.ieee.org
This book addresses some theoretical aspects of semisupervised learning (SSL). The book
is organized as a collection of different contributions of authors who are experts on this topic …

[PDF][PDF] A comparison of event models for naive bayes text classification

A McCallum, K Nigam - AAAI-98 workshop on learning for …, 1998 - yangli-feasibility.com
Recent approaches to text classification have used two different first-order probabilistic
models for classification, both of which make the naive Bayes assumption. Some use a multi …

[BOEK][B] The text mining handbook: advanced approaches in analyzing unstructured data

R Feldman, J Sanger - 2007 - books.google.com
Text mining is a new and exciting area of computer science research that tries to solve the
crisis of information overload by combining techniques from data mining, machine learning …

Content-based recommendation systems

MJ Pazzani, D Billsus - The adaptive web: methods and strategies of web …, 2007 - Springer
This chapter discusses content-based recommendation systems, ie, systems that
recommend an item to a user based upon a description of the item and a profile of the user's …