[PDF][PDF] The MADAR Arabic dialect corpus and lexicon

H Bouamor, N Habash, M Salameh… - Proceedings of the …, 2018 - aclanthology.org
In this paper, we present two resources that were created as part of the Multi Arabic Dialect
Applications and Resources (MADAR) project. The first is a large parallel corpus of 25 …

Wojood: Nested arabic named entity corpus and recognition using bert

M Jarrar, M Khalilia, S Ghanem - arxiv preprint arxiv:2205.09651, 2022 - arxiv.org
This paper presents Wojood, a corpus for Arabic nested Named Entity Recognition (NER).
Nested entities occur when one entity mention is embedded inside another entity mention …

Curras: an annotated corpus for the Palestinian Arabic dialect

M Jarrar, N Habash, F Alrimawi, D Akra… - Language Resources …, 2017 - Springer
In this article we present Curras, the first morphologically annotated corpus of the
Palestinian Arabic dialect. Palestinian Arabic is one of the many primarily spoken dialects of …

Arabic fine-grained entity recognition

H Liqreina, M Jarrar, M Khalilia, AO El-Shangiti… - arxiv preprint arxiv …, 2023 - arxiv.org
Traditional NER systems are typically trained to recognize coarse-grained entities, and less
attention is given to classifying entities into a hierarchy of fine-grained lower-level subtypes …

The Arabic ontology–an Arabic wordnet with ontologically clean content

M Jarrar - Applied ontology, 2021 - content.iospress.com
We present a formal Arabic wordnet built on the basis of a carefully designed ontology
hereby referred to as the Arabic Ontology. The ontology provides a formal representation of …

Arbanking77: Intent detection neural model and a new dataset in modern and dialectical arabic

M Jarrar, A Birim, M Khalilia, M Erden… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper presents the ArBanking77, a large Arabic dataset for intent detection in the
banking domain. Our dataset was arabized and localized from the original English …

[PDF][PDF] Shami: A corpus of levantine arabic dialects

KA Kwaik, M Saad, S Chatzikyriakidis… - Proceedings of the …, 2018 - aclanthology.org
Abstract Modern Standard Arabic (MSA) is the official language used in education and
media across the Arab world both in writing and formal speech. However, in daily …

Nabra: Syrian Arabic Dialects with Morphological Annotations

A Nayouf, T Hammouda, M Jarrar, F Zaraket… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper presents Nabra, a corpora of Syrian Arabic dialects with morphological
annotations. A team of Syrian natives collected more than 6K sentences containing about …

[PDF][PDF] Natural language processing for dialectical Arabic: A survey

A Shoufan, S Alameri - Proceedings of the second workshop on …, 2015 - aclanthology.org
This paper presents a wide literature review of natural language processing for dialectical
Arabic. Four main research areas were identified and the dialect coverage in research work …

Curras+ baladi: Towards a levantine corpus

KE Haff, M Jarrar, T Hammouda, F Zaraket - arxiv preprint arxiv …, 2022 - arxiv.org
The processing of the Arabic language is a complex field of research. This is due to many
factors, including the complex and rich morphology of Arabic, its high degree of ambiguity …