Demographic dialectal variation in social media: A case study of African-American English
SL Blodgett, L Green, B O'Connor - ar** NLP tools to handle such language. We conduct a case study of dialectal …
Automatic language identification in texts: A survey
Language identification (" LI") is the problem of determining the natural language that a
document or part thereof is written in. Automatic LI has been extensively researched for over …
document or part thereof is written in. Automatic LI has been extensively researched for over …
Feature extraction methods in language identification: a survey
Abstract Language Identification (LI) is one of the widely emerging field in the areas of
speech processing to accurately identify the language from the data base based on some …
speech processing to accurately identify the language from the data base based on some …
[PDF][PDF] Overview of the DSL shared task 2015
We present the results of the 2nd edition of the Discriminating between Similar Languages
(DSL) shared task, which was organized as part of the LT4VarDial'2015 workshop and …
(DSL) shared task, which was organized as part of the LT4VarDial'2015 workshop and …
A survey on multi-modal social event detection
H Zhou, H Yin, H Zheng, Y Li - Knowledge-Based Systems, 2020 - Elsevier
Due to the prevalence of social media sites, users are allowed to conveniently share their
ideas and activities anytime and anywhere. Therefore, these sites hold substantial real …
ideas and activities anytime and anywhere. Therefore, these sites hold substantial real …
[PDF][PDF] Language identification using classifier ensembles
In this paper we describe the language identification system we developed for the
Discriminating Similar Languages (DSL) 2015 shared task. We constructed a classifier …
Discriminating Similar Languages (DSL) 2015 shared task. We constructed a classifier …
Arabic dialect identification in speech transcripts
In this paper we describe a system developed to identify a set of four regional Arabic dialects
(Egyptian, Gulf, Levantine, North African) and Modern Standard Arabic (MSA) in a …
(Egyptian, Gulf, Levantine, North African) and Modern Standard Arabic (MSA) in a …
Chinese dialect speech recognition: a comprehensive survey
Q Li, Q Mai, M Wang, M Ma - Artificial Intelligence Review, 2024 - Springer
As a multi-ethnic country with a large population, China is endowed with diverse dialects,
which brings considerable challenges to speech recognition work. In fact, due to …
which brings considerable challenges to speech recognition work. In fact, due to …
Sociolinguistically driven approaches for just natural language processing
SL Blodgett - 2021 - scholarworks.umass.edu
Natural language processing (NLP) systems are now ubiquitous. Yet the benefits of these
language technologies do not accrue evenly to all users, and indeed they can be harmful; …
language technologies do not accrue evenly to all users, and indeed they can be harmful; …
Discriminating similar languages: Evaluations and explorations
We present an analysis of the performance of machine learning classifiers on discriminating
between similar languages and language varieties. We carried out a number of experiments …
between similar languages and language varieties. We carried out a number of experiments …