[PDF][PDF] The MADAR Arabic dialect corpus and lexicon
In this paper, we present two resources that were created as part of the Multi Arabic Dialect
Applications and Resources (MADAR) project. The first is a large parallel corpus of 25 …
Applications and Resources (MADAR) project. The first is a large parallel corpus of 25 …
Wojood: Nested arabic named entity corpus and recognition using bert
This paper presents Wojood, a corpus for Arabic nested Named Entity Recognition (NER).
Nested entities occur when one entity mention is embedded inside another entity mention …
Nested entities occur when one entity mention is embedded inside another entity mention …
Curras: an annotated corpus for the Palestinian Arabic dialect
In this article we present Curras, the first morphologically annotated corpus of the
Palestinian Arabic dialect. Palestinian Arabic is one of the many primarily spoken dialects of …
Palestinian Arabic dialect. Palestinian Arabic is one of the many primarily spoken dialects of …
Arabic fine-grained entity recognition
Traditional NER systems are typically trained to recognize coarse-grained entities, and less
attention is given to classifying entities into a hierarchy of fine-grained lower-level subtypes …
attention is given to classifying entities into a hierarchy of fine-grained lower-level subtypes …
The Arabic ontology–an Arabic wordnet with ontologically clean content
M Jarrar - Applied ontology, 2021 - content.iospress.com
We present a formal Arabic wordnet built on the basis of a carefully designed ontology
hereby referred to as the Arabic Ontology. The ontology provides a formal representation of …
hereby referred to as the Arabic Ontology. The ontology provides a formal representation of …
Arbanking77: Intent detection neural model and a new dataset in modern and dialectical arabic
This paper presents the ArBanking77, a large Arabic dataset for intent detection in the
banking domain. Our dataset was arabized and localized from the original English …
banking domain. Our dataset was arabized and localized from the original English …
[PDF][PDF] Shami: A corpus of levantine arabic dialects
Abstract Modern Standard Arabic (MSA) is the official language used in education and
media across the Arab world both in writing and formal speech. However, in daily …
media across the Arab world both in writing and formal speech. However, in daily …
Nabra: Syrian Arabic Dialects with Morphological Annotations
This paper presents Nabra, a corpora of Syrian Arabic dialects with morphological
annotations. A team of Syrian natives collected more than 6K sentences containing about …
annotations. A team of Syrian natives collected more than 6K sentences containing about …
[PDF][PDF] Natural language processing for dialectical Arabic: A survey
A Shoufan, S Alameri - Proceedings of the second workshop on …, 2015 - aclanthology.org
This paper presents a wide literature review of natural language processing for dialectical
Arabic. Four main research areas were identified and the dialect coverage in research work …
Arabic. Four main research areas were identified and the dialect coverage in research work …
Curras+ baladi: Towards a levantine corpus
The processing of the Arabic language is a complex field of research. This is due to many
factors, including the complex and rich morphology of Arabic, its high degree of ambiguity …
factors, including the complex and rich morphology of Arabic, its high degree of ambiguity …