[PDF][PDF] What to do about bad language on the internet
J Eisenstein - Proceedings of the 2013 conference of the North …, 2013 - aclanthology.org
The rise of social media has brought computational linguistics in ever-closer contact with
bad language: text that defies our expectations about vocabulary, spelling, and syntax. This …
bad language: text that defies our expectations about vocabulary, spelling, and syntax. This …
[PDF][PDF] Shared tasks of the 2015 workshop on noisy user-generated text: Twitter lexical normalization and named entity recognition
This paper presents the results of the two shared tasks associated with W-NUT 2015:(1) a
text normalization task with 10 participants; and (2) a named entity tagging task with 8 …
text normalization task with 10 participants; and (2) a named entity tagging task with 8 …
Neural models of text normalization for speech applications
Abstract Machine learning, including neural network techniques, have been applied to
virtually every domain in natural language processing. One problem that has been …
virtually every domain in natural language processing. One problem that has been …
Lexical normalization for social media text
Twitter provides access to large volumes of data in real time, but is notoriously noisy,
hampering its utility for NLP. In this article, we target out-of-vocabulary words in short text …
hampering its utility for NLP. In this article, we target out-of-vocabulary words in short text …
[PDF][PDF] Automatically constructing a normalisation dictionary for microblogs
Microblog normalisation methods often utilise complex models and struggle to differentiate
between correctly-spelled unknown words and lexical variants of known words. In this …
between correctly-spelled unknown words and lexical variants of known words. In this …
Streaming trend detection in twitter
J Benhardus, J Kalita - International Journal of Web Based …, 2013 - inderscienceonline.com
As social media continue to grow, the zeitgeist of society is increasingly found not in the
headlines of traditional media institutions, but in the activity of ordinary individuals. The …
headlines of traditional media institutions, but in the activity of ordinary individuals. The …
RNN approaches to text normalization: A challenge
This paper presents a challenge to the community: given a large corpus of written text
aligned to its normalized spoken form, train an RNN to learn the correct normalization …
aligned to its normalized spoken form, train an RNN to learn the correct normalization …
[PDF][PDF] A broad-coverage normalization system for social media language
Social media language contains huge amount and wide variety of nonstandard tokens,
created both intentionally and unintentionally by the users. It is of crucial importance to …
created both intentionally and unintentionally by the users. It is of crucial importance to …
A survey on syntactic processing techniques
Computational syntactic processing is a fundamental technique in natural language
processing. It normally serves as a pre-processing method to transform natural language …
processing. It normally serves as a pre-processing method to transform natural language …
Phonetic-based microtext normalization for twitter sentiment analysis
The proliferation of Web 2.0 technologies and the increasing use of computer-mediated
communication resulted in a new form of written text, termed microtext. This poses new …
communication resulted in a new form of written text, termed microtext. This poses new …