[PDF][PDF] Speech and language processing
D Jurafsky - 2000 - academia.edu
" This book is an absolute necessity for instructors at all levels, as well as an indispensible
reference for researchers. Introducing NLP, computational linguistics, and speech …
reference for researchers. Introducing NLP, computational linguistics, and speech …
Verb classification across languages
Recent developments in language modeling have enabled large text encoders to derive a
wealth of linguistic information from raw text corpora without supervision. Their success …
wealth of linguistic information from raw text corpora without supervision. Their success …
[PDF][PDF] Semeval-2013 task 12: Multilingual word sense disambiguation
This paper presents the SemEval-2013 task on multilingual Word Sense Disambiguation.
We describe our experience in producing a multilingual sense-annotated corpus for the task …
We describe our experience in producing a multilingual sense-annotated corpus for the task …
Breaking sticks and ambiguities with adaptive skip-gram
S Bartunov, D Kondrashkin… - artificial intelligence …, 2016 - proceedings.mlr.press
The recently proposed Skip-gram model is a powerful method for learning high-dimensional
word representations that capture rich semantic relationships between words. However …
word representations that capture rich semantic relationships between words. However …
Making sense of word embeddings
We present a simple yet effective approach for learning word sense embeddings. In contrast
to existing techniques, which either directly learn sense representations from corpora or rely …
to existing techniques, which either directly learn sense representations from corpora or rely …
[PDF][PDF] One million sense-tagged instances for word sense disambiguation and induction
Supervised word sense disambiguation (WSD) systems are usually the best performing
systems when evaluated on standard benchmarks. However, these systems need annotated …
systems when evaluated on standard benchmarks. However, these systems need annotated …
Characterizing English variation across social media communities with BERT
Much previous work characterizing language variation across Internet social groups has
focused on the types of words used by these groups. We extend this type of study by …
focused on the types of words used by these groups. We extend this type of study by …
Word sense induction with neural biLM and symmetric patterns
A Amrami, Y Goldberg - arxiv preprint arxiv:1808.08518, 2018 - arxiv.org
An established method for Word Sense Induction (WSI) uses a language model to predict
probable substitutes for target words, and induces senses by clustering these resulting …
probable substitutes for target words, and induces senses by clustering these resulting …
[PDF][PDF] Semeval-2016 task 14: Semantic taxonomy enrichment
Manually constructed taxonomies provide a crucial resource for many NLP technologies, yet
these resources are often limited in their lexical coverage due to their construction …
these resources are often limited in their lexical coverage due to their construction …
Always keep your target in mind: Studying semantics and improving performance of neural lexical substitution
N Arefyev, B Sheludko, A Podolskiy… - arxiv preprint arxiv …, 2022 - arxiv.org
Lexical substitution, ie generation of plausible words that can replace a particular target
word in a given context, is an extremely powerful technology that can be used as a …
word in a given context, is an extremely powerful technology that can be used as a …