[BOOK][B] An introduction to information retrieval

CD Manning - 2009 - edl.emi.gov.et
As recently as the 1990s, studies showed that most people preferred getting information
from other people rather than from information retrieval systems. Of course, in that time …

[PDF][PDF] Speech and language processing

D Jurafsky - 2000 - academia.edu
" This book is an absolute necessity for instructors at all levels, as well as an indispensible
reference for researchers. Introducing NLP, computational linguistics, and speech …

A language modeling approach to information retrieval

JM Ponte, WB Croft - ACM SIGIR Forum, 2017 - dl.acm.org
Abstract Models of document indexing and docu-ment retrieval have been extensively
studied. The integration of these two classes of models has been the goal of several …

[BOOK][B] Handbook of natural language processing

N Indurkhya, FJ Damerau - 2010 - taylorfrancis.com
The Handbook of Natural Language Processing, Second Edition presents practical tools
and techniques for implementing natural language processing in computer systems. Along …

[BOOK][B] Topic detection and tracking: event-based information organization

J Allan - 2002 - books.google.com
The purposeofthis book is to providea recordofthe stateofthe art in Topic Detection and
Tracking (TDT) in a single place. Research in TDT has been going on for about five years …

The penn chinese treebank: Phrase structure annotation of a large corpus

N Xue, F **a, FD Chiou, M Palmer - Natural language engineering, 2005 - cambridge.org
With growing interest in Chinese Language Processing, numerous NLP tools (eg, word
segmenters, part-of-speech taggers, and parsers) for Chinese have been developed all over …

[PDF][PDF] Chinese word segmentation as character tagging

N Xue - International Journal of Computational Linguistics & …, 2003 - aclanthology.org
In this paper we report results of a supervised machine-learning approach to Chinese word
segmentation. A maximum entropy tagger is trained on manually annotated data to …

Good‐turing frequency estimation without tears

WA Gale, G Sampson - Journal of quantitative linguistics, 1995 - Taylor & Francis
Linguists and speech researchers who use statistical methods often need to estimate the
frequency of some type of item in a population containing items of various types. A common …

Normalization of non-standard words

R Sproat, AW Black, S Chen, S Kumar… - Computer speech & …, 2001 - Elsevier
In addition to ordinary words and names, real text contains non-standard “words"(NSWs),
including numbers, abbreviations, dates, currency amounts and acronyms. Typically, one …

Mining quality phrases from massive text corpora

J Liu, J Shang, C Wang, X Ren, J Han - Proceedings of the 2015 ACM …, 2015 - dl.acm.org
Text data are ubiquitous and play an essential role in big data applications. However, text
data are mostly unstructured. Transforming unstructured text into structured units (eg …