A brief survey of text mining: Classification, clustering and extraction techniques

M Allahyari, S Pouriyeh, M Assefi, S Safaei… - arxiv preprint arxiv …, 2017 - arxiv.org
The amount of text that is generated every day is increasing dramatically. This tremendous
volume of mostly unstructured text cannot be simply processed and perceived by computers …

Machine learning in automated text categorization

F Sebastiani - ACM computing surveys (CSUR), 2002 - dl.acm.org
The automated categorization (or classification) of texts into predefined categories has
witnessed a booming interest in the last 10 years, due to the increased availability of …

A label attention model for ICD coding from clinical text

T Vu, DQ Nguyen, A Nguyen - arxiv preprint arxiv:2007.06351, 2020 - arxiv.org
ICD coding is a process of assigning the International Classification of Disease diagnosis
codes to clinical/medical notes documented by health professionals (eg clinicians). This …

[BOG][B] Text data mining

C Zong, R **a, J Zhang - 2021 - Springer
With the rapid development and popularization of Internet and mobile communication
technologies, text data mining has attracted much attention. In particular, with the wide use …

[BOG][B] Quantum machine learning: what quantum computing means to data mining

P Wittek - 2014 - books.google.com
Quantum Machine Learning bridges the gap between abstract developments in quantum
computing and the applied research on machine learning. Paring down the complexity of the …

A survey of text classification algorithms

CC Aggarwal, CX Zhai - Mining text data, 2012 - Springer
The problem of classification has been widely studied in the data mining, machine learning,
database, and information retrieval communities with applications in a number of diverse …

ICD coding from clinical text using multi-filter residual convolutional neural network

F Li, H Yu - proceedings of the AAAI conference on artificial …, 2020 - ojs.aaai.org
Automated ICD coding, which assigns the International Classification of Disease codes to
patient visits, has attracted much research attention since it can save time and labor for …

[BOG][B] Modern information retrieval

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu
Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

[PDF][PDF] A comparison of event models for naive bayes text classification

A McCallum, K Nigam - AAAI-98 workshop on learning for …, 1998 - yangli-feasibility.com
Recent approaches to text classification have used two different first-order probabilistic
models for classification, both of which make the naive Bayes assumption. Some use a multi …

Code synonyms do matter: Multiple synonyms matching network for automatic ICD coding

Z Yuan, C Tan, S Huang - arxiv preprint arxiv:2203.01515, 2022 - arxiv.org
Automatic ICD coding is defined as assigning disease codes to electronic medical records
(EMRs). Existing methods usually apply label attention with code representations to match …