Lexicon annotation in sentiment analysis for dialectal Arabic: Systematic review of current trends and future directions
Due to the vast volumes of newly streamed data on the Internet and social media, the use of
sentiment analysis (SA) to extract information and analyze people's opinions has become a …
sentiment analysis (SA) to extract information and analyze people's opinions has become a …
[HTML][HTML] Arabic natural language processing: An overview
Arabic is recognised as the 4th most used language of the Internet. Arabic has three main
varieties:(1) classical Arabic (CA),(2) Modern Standard Arabic (MSA),(3) Arabic Dialect (AD) …
varieties:(1) classical Arabic (CA),(2) Modern Standard Arabic (MSA),(3) Arabic Dialect (AD) …
Language model tokenizers introduce unfairness between languages
Recent language models have shown impressive multilingual performance, even when not
explicitly trained for it. Despite this, there are concerns about the quality of their outputs …
explicitly trained for it. Despite this, there are concerns about the quality of their outputs …
The interplay of variant, size, and task type in Arabic pre-trained language models
In this paper, we explore the effects of language variants, data sizes, and fine-tuning task
types in Arabic pre-trained language models. To do so, we build three pre-trained language …
types in Arabic pre-trained language models. To do so, we build three pre-trained language …
CAMeL tools: An open source python toolkit for Arabic natural language processing
Abstract We present CAMeL Tools, a collection of open-source tools for Arabic natural
language processing in Python. CAMeL Tools currently provides utilities for pre-processing …
language processing in Python. CAMeL Tools currently provides utilities for pre-processing …
AraT5: Text-to-text transformers for Arabic language generation
Transfer learning with a unified Transformer framework (T5) that converts all language
problems into a text-to-text format was recently proposed as a simple and effective transfer …
problems into a text-to-text format was recently proposed as a simple and effective transfer …
NADI 2022: The third nuanced Arabic dialect identification shared task
We describe findings of the third Nuanced Arabic Dialect Identification Shared Task (NADI
2022). NADI aims at advancing state of the art Arabic NLP, including on Arabic dialects. It …
2022). NADI aims at advancing state of the art Arabic NLP, including on Arabic dialects. It …
Natural language processing for dialects of a language: A survey
State-of-the-art natural language processing (NLP) models are trained on massive training
corpora, and report a superlative performance on evaluation datasets. This survey delves …
corpora, and report a superlative performance on evaluation datasets. This survey delves …
The MADAR shared task on Arabic fine-grained dialect identification
In this paper, we present the results and findings of the MADAR Shared Task on Arabic Fine-
Grained Dialect Identification. This shared task was organized as part of The Fourth Arabic …
Grained Dialect Identification. This shared task was organized as part of The Fourth Arabic …
A panoramic survey of natural language processing in the Arab world
THE TERM NATURAL language refers to any system of symbolic communication (spoken,
signed, or written) that has evolved naturally in humans without intentional human planning …
signed, or written) that has evolved naturally in humans without intentional human planning …