Natural language processing for dialects of a language: A survey
State-of-the-art natural language processing (NLP) models are trained on massive training
corpora, and report a superlative performance on evaluation datasets. This survey delves …
corpora, and report a superlative performance on evaluation datasets. This survey delves …
Camelira: An Arabic multi-dialect morphological disambiguator
We present Camelira, a web-based Arabic multi-dialect morphological disambiguation tool
that covers four major variants of Arabic: Modern Standard Arabic, Egyptian, Gulf, and …
that covers four major variants of Arabic: Modern Standard Arabic, Egyptian, Gulf, and …
Morphotactic modeling in an open-source multi-dialectal Arabic morphological analyzer and generator
Arabic is a morphologically rich and complex language, with numerous dialectal variants.
Previous efforts on Arabic morphology modeling focused on specific variants and specific …
Previous efforts on Arabic morphology modeling focused on specific variants and specific …
The Najdi Arabic Corpus: a new corpus for an underrepresented Arabic dialect
R Alhedayani - Language Resources and Evaluation, 2024 - Springer
This paper presents a new corpus for a dialect of Arabic spoken in the central region of
Saudi Arabia: the Najdi Arabic Corpus. This is the first publicly available corpus for this …
Saudi Arabia: the Najdi Arabic Corpus. This is the first publicly available corpus for this …
Transformers on multilingual clause-level morphology
This paper describes our winning systems in MRL: The 1st Shared Task on Multilingual
Clause-level Morphology (EMNLP 2022 Workshop) designed by KUIS AI NLP team. We …
Clause-level Morphology (EMNLP 2022 Workshop) designed by KUIS AI NLP team. We …
ALMA: Fast Lemmatizer and POS Tagger for Arabic
We introduce Alma (), an open-source and state-of-the-art lemmatizer, POS tagger, and root
tagger for Arabic, boasting both high speed and accuracy. Alma relies on a dictionary of …
tagger for Arabic, boasting both high speed and accuracy. Alma relies on a dictionary of …
Strategies for Arabic Readability Modeling
Automatic readability assessment is relevant to building NLP applications for education,
content analysis, and accessibility. However, Arabic readability assessment is a challenging …
content analysis, and accessibility. However, Arabic readability assessment is a challenging …
Computational Morphology and Lexicography Modeling of Modern Standard Arabic Nominals
Modern Standard Arabic (MSA) nominals present many morphological and lexical modeling
challenges that have not been consistently addressed previously. This paper attempts to …
challenges that have not been consistently addressed previously. This paper attempts to …
Deep Active Learning for Morphophonological Processing
Building a system for morphological processing is a challenging task in morphologically
complex languages like Arabic. Although there are some deep learning based models that …
complex languages like Arabic. Although there are some deep learning based models that …
Advancements in Arabic grammatical error detection and correction: An empirical investigation
Grammatical error correction (GEC) is a well-explored problem in English with many existing
models and datasets. However, research on GEC in morphologically rich languages has …
models and datasets. However, research on GEC in morphologically rich languages has …