[BOOK][B] An introduction to information retrieval
CD Manning - 2009 - edl.emi.gov.et
As recently as the 1990s, studies showed that most people preferred getting information
from other people rather than from information retrieval systems. Of course, in that time …
from other people rather than from information retrieval systems. Of course, in that time …
[PDF][PDF] Speech and language processing
D Jurafsky - 2000 - academia.edu
" This book is an absolute necessity for instructors at all levels, as well as an indispensible
reference for researchers. Introducing NLP, computational linguistics, and speech …
reference for researchers. Introducing NLP, computational linguistics, and speech …
A language modeling approach to information retrieval
JM Ponte, WB Croft - ACM SIGIR Forum, 2017 - dl.acm.org
Abstract Models of document indexing and docu-ment retrieval have been extensively
studied. The integration of these two classes of models has been the goal of several …
studied. The integration of these two classes of models has been the goal of several …
[BOOK][B] Handbook of natural language processing
N Indurkhya, FJ Damerau - 2010 - taylorfrancis.com
The Handbook of Natural Language Processing, Second Edition presents practical tools
and techniques for implementing natural language processing in computer systems. Along …
and techniques for implementing natural language processing in computer systems. Along …
[BOOK][B] Topic detection and tracking: event-based information organization
J Allan - 2002 - books.google.com
The purposeofthis book is to providea recordofthe stateofthe art in Topic Detection and
Tracking (TDT) in a single place. Research in TDT has been going on for about five years …
Tracking (TDT) in a single place. Research in TDT has been going on for about five years …
The penn chinese treebank: Phrase structure annotation of a large corpus
With growing interest in Chinese Language Processing, numerous NLP tools (eg, word
segmenters, part-of-speech taggers, and parsers) for Chinese have been developed all over …
segmenters, part-of-speech taggers, and parsers) for Chinese have been developed all over …
[PDF][PDF] Chinese word segmentation as character tagging
N Xue - International Journal of Computational Linguistics & …, 2003 - aclanthology.org
In this paper we report results of a supervised machine-learning approach to Chinese word
segmentation. A maximum entropy tagger is trained on manually annotated data to …
segmentation. A maximum entropy tagger is trained on manually annotated data to …
Good‐turing frequency estimation without tears
WA Gale, G Sampson - Journal of quantitative linguistics, 1995 - Taylor & Francis
Linguists and speech researchers who use statistical methods often need to estimate the
frequency of some type of item in a population containing items of various types. A common …
frequency of some type of item in a population containing items of various types. A common …
Normalization of non-standard words
In addition to ordinary words and names, real text contains non-standard “words"(NSWs),
including numbers, abbreviations, dates, currency amounts and acronyms. Typically, one …
including numbers, abbreviations, dates, currency amounts and acronyms. Typically, one …
Mining quality phrases from massive text corpora
Text data are ubiquitous and play an essential role in big data applications. However, text
data are mostly unstructured. Transforming unstructured text into structured units (eg …
data are mostly unstructured. Transforming unstructured text into structured units (eg …