Natural language processing for dialects of a language: A survey

A Joshi, R Dabre, D Kanojia, Z Li, H Zhan… - ACM Computing …, 2025 - dl.acm.org
State-of-the-art natural language processing (NLP) models are trained on massive training
corpora, and report a superlative performance on evaluation datasets. This survey delves …

[HTML][HTML] Arabic natural language processing: An overview

I Guellil, H Saâdane, F Azouaou, B Gueni… - Journal of King Saud …, 2021 - Elsevier
Arabic is recognised as the 4th most used language of the Internet. Arabic has three main
varieties:(1) classical Arabic (CA),(2) Modern Standard Arabic (MSA),(3) Arabic Dialect (AD) …

Part-of-speech tagging for Arabic tweets using CRF and Bi-LSTM

W AlKhwiter, N Al-Twairesh - Computer Speech & Language, 2021 - Elsevier
Over the past few years, Twitter has experienced massive growth and the volume of its
online content has increased rapidly. This content has been a rich source for several studies …

[PDF][PDF] A morphologically annotated corpus of Emirati Arabic

S Khalifa, N Habash, F Eryani, O Obeid… - Proceedings of the …, 2018 - aclanthology.org
We present an ongoing effort on the first large-scale morphologically manually annotated
corpus of Emirati Arabic. This corpus includes about 200,000 words selected from eight …

Morphological analysis and disambiguation for Gulf Arabic: The interplay between resources and methods

S Khalifa, N Zalmout, N Habash - Proceedings of the Twelfth …, 2020 - aclanthology.org
In this paper we present the first full morphological analysis and disambiguation system for
Gulf Arabic. We use an existing state-of-the-art morphological disambiguation system to …

The Najdi Arabic Corpus: a new corpus for an underrepresented Arabic dialect

R Alhedayani - Language Resources and Evaluation, 2024 - Springer
This paper presents a new corpus for a dialect of Arabic spoken in the central region of
Saudi Arabia: the Najdi Arabic Corpus. This is the first publicly available corpus for this …

ADIDA: Automatic dialect identification for Arabic

O Obeid, M Salameh, H Bouamor… - Proceedings of the 2019 …, 2019 - aclanthology.org
This demo paper describes ADIDA, a web-based system for automatic dialect identification
for Arabic text. The system distinguishes among the dialects of 25 Arab cities (from Rabat to …

NLP for Enterprise Asset Management: An Emerging Paradigm

P Santos, N Datia, M Pato, J Sobral… - 2023 27th …, 2023 - ieeexplore.ieee.org
In the field of asset management, a Work Order refers to a document that outlines the
necessary steps to carry out a maintenance operation on a specific physical asset. The text …

Palmyra 2.0: A configurable multilingual platform independent tool for morphology and syntax annotation

D Taji, N Habash - Proceedings of the Fourth Workshop on …, 2020 - aclanthology.org
Abstract We present PALMYRA 2.0, a graphical dependency-tree visualization and editing
software. PALMYRA 2.0 is designed to be highly configurable to any dependency parsing …

[PDF][PDF] Computer and Information Sciences

I Guellil, H Saâdane, F Azouaou… - Journal of King Saud …, 2021 - damien.nouvels.net
abstract Arabic is recognised as the 4th most used language of the Internet. Arabic has three
main varieties:(1) classical Arabic (CA),(2) Modern Standard Arabic (MSA),(3) Arabic Dialect …