[PDF][PDF] What to do about bad language on the internet

J Eisenstein - Proceedings of the 2013 conference of the North …, 2013 - aclanthology.org
The rise of social media has brought computational linguistics in ever-closer contact with
bad language: text that defies our expectations about vocabulary, spelling, and syntax. This …

[PDF][PDF] Shared tasks of the 2015 workshop on noisy user-generated text: Twitter lexical normalization and named entity recognition

T Baldwin, MC De Marneffe, B Han… - Proceedings of the …, 2015 - aclanthology.org
This paper presents the results of the two shared tasks associated with W-NUT 2015:(1) a
text normalization task with 10 participants; and (2) a named entity tagging task with 8 …

Neural models of text normalization for speech applications

H Zhang, R Sproat, AH Ng, F Stahlberg… - Computational …, 2019 - direct.mit.edu
Abstract Machine learning, including neural network techniques, have been applied to
virtually every domain in natural language processing. One problem that has been …

Lexical normalization for social media text

B Han, P Cook, T Baldwin - … on Intelligent Systems and Technology (TIST …, 2013 - dl.acm.org
Twitter provides access to large volumes of data in real time, but is notoriously noisy,
hampering its utility for NLP. In this article, we target out-of-vocabulary words in short text …

[PDF][PDF] Automatically constructing a normalisation dictionary for microblogs

B Han, P Cook, T Baldwin - … of the 2012 joint conference on …, 2012 - aclanthology.org
Microblog normalisation methods often utilise complex models and struggle to differentiate
between correctly-spelled unknown words and lexical variants of known words. In this …

Streaming trend detection in twitter

J Benhardus, J Kalita - International Journal of Web Based …, 2013 - inderscienceonline.com
As social media continue to grow, the zeitgeist of society is increasingly found not in the
headlines of traditional media institutions, but in the activity of ordinary individuals. The …

RNN approaches to text normalization: A challenge

R Sproat, N Jaitly - arxiv preprint arxiv:1611.00068, 2016 - arxiv.org
This paper presents a challenge to the community: given a large corpus of written text
aligned to its normalized spoken form, train an RNN to learn the correct normalization …

[PDF][PDF] A broad-coverage normalization system for social media language

F Liu, F Weng, X Jiang - Proceedings of the 50th Annual Meeting …, 2012 - aclanthology.org
Social media language contains huge amount and wide variety of nonstandard tokens,
created both intentionally and unintentionally by the users. It is of crucial importance to …

A survey on syntactic processing techniques

X Zhang, R Mao, E Cambria - Artificial Intelligence Review, 2023 - Springer
Computational syntactic processing is a fundamental technique in natural language
processing. It normally serves as a pre-processing method to transform natural language …

Phonetic-based microtext normalization for twitter sentiment analysis

R Satapathy, C Guerreiro, I Chaturvedi… - … conference on data …, 2017 - ieeexplore.ieee.org
The proliferation of Web 2.0 technologies and the increasing use of computer-mediated
communication resulted in a new form of written text, termed microtext. This poses new …