Automatic language identification in texts: A survey

T Jauhiainen, M Lui, M Zampieri, T Baldwin… - Journal of Artificial …, 2019 - jair.org
Language identification (" LI") is the problem of determining the natural language that a
document or part thereof is written in. Automatic LI has been extensively researched for over …

Feature extraction methods in language identification: a survey

D Deshwal, P Sangwan, D Kumar - Wireless Personal Communications, 2019 - Springer
Abstract Language Identification (LI) is one of the widely emerging field in the areas of
speech processing to accurately identify the language from the data base based on some …

[PDF][PDF] Overview of the DSL shared task 2015

M Zampieri, L Tan, N Ljubešić… - Proceedings of the …, 2015 - aclanthology.org
We present the results of the 2nd edition of the Discriminating between Similar Languages
(DSL) shared task, which was organized as part of the LT4VarDial'2015 workshop and …

A survey on multi-modal social event detection

H Zhou, H Yin, H Zheng, Y Li - Knowledge-Based Systems, 2020 - Elsevier
Due to the prevalence of social media sites, users are allowed to conveniently share their
ideas and activities anytime and anywhere. Therefore, these sites hold substantial real …

[PDF][PDF] Language identification using classifier ensembles

S Malmasi, M Dras - Proceedings of the joint workshop on …, 2015 - aclanthology.org
In this paper we describe the language identification system we developed for the
Discriminating Similar Languages (DSL) 2015 shared task. We constructed a classifier …

Arabic dialect identification in speech transcripts

S Malmasi, M Zampieri - Proceedings of the Third Workshop on …, 2016 - aclanthology.org
In this paper we describe a system developed to identify a set of four regional Arabic dialects
(Egyptian, Gulf, Levantine, North African) and Modern Standard Arabic (MSA) in a …

Chinese dialect speech recognition: a comprehensive survey

Q Li, Q Mai, M Wang, M Ma - Artificial Intelligence Review, 2024 - Springer
As a multi-ethnic country with a large population, China is endowed with diverse dialects,
which brings considerable challenges to speech recognition work. In fact, due to …

Sociolinguistically driven approaches for just natural language processing

SL Blodgett - 2021 - scholarworks.umass.edu
Natural language processing (NLP) systems are now ubiquitous. Yet the benefits of these
language technologies do not accrue evenly to all users, and indeed they can be harmful; …

Discriminating similar languages: Evaluations and explorations

C Goutte, S Léger, S Malmasi, M Zampieri - arxiv preprint arxiv …, 2016 - arxiv.org
We present an analysis of the performance of machine learning classifiers on discriminating
between similar languages and language varieties. We carried out a number of experiments …