Predicting the type and target of offensive posts in social media

M Zampieri, S Malmasi, P Nakov, S Rosenthal… - arxiv preprint arxiv …, 2019 - arxiv.org
As offensive content has become pervasive in social media, there has been much research
in identifying potentially offensive messages. However, previous work on this topic did not …

NADI 2022: The third nuanced Arabic dialect identification shared task

M Abdul-Mageed, C Zhang, AR Elmadany… - arxiv preprint arxiv …, 2022 - arxiv.org
We describe findings of the third Nuanced Arabic Dialect Identification Shared Task (NADI
2022). NADI aims at advancing state of the art Arabic NLP, including on Arabic dialects. It …

A panoramic survey of natural language processing in the Arab world

K Darwish, N Habash, M Abbas, H Al-Khalifa… - Communications of the …, 2021 - dl.acm.org
THE TERM NATURAL language refers to any system of symbolic communication (spoken,
signed, or written) that has evolved naturally in humans without intentional human planning …

Findings of the VarDial evaluation campaign 2023

N Aepli, Ç Çöltekin, R Van Der Goot… - arxiv preprint arxiv …, 2023 - arxiv.org
This report presents the results of the shared tasks organized as part of the VarDial
Evaluation Campaign 2023. The campaign is part of the tenth workshop on Natural …

A report on the third VarDial evaluation campaign

M Zampieri, S Malmasi, Y Scherrer… - Workshop on NLP …, 2019 - researchportal.helsinki.fi
In this paper, we present the findings of the Third VarDial Evaluation Campaign organized
as part of the sixth edition of the workshop on Natural Language Processing (NLP) for …

[PDF][PDF] Findings of the VarDial evaluation campaign 2021

BR Chakravarthi, M Găman, RT Ionescu, H Jauhiainen… - EACL| VarDial, 2021 - orbilu.uni.lu
This paper describes the results of the shared tasks organized as part of the VarDial
Evaluation Campaign 2021. The campaign was part of the eighth workshop on Natural …

A report on the VarDial evaluation campaign 2020

M Gaman, D Hovy, RT Ionescu… - Proceedings of the …, 2020 - aclanthology.org
This paper presents the results of the VarDial Evaluation Campaign 2020 organized as part
of the seventh workshop on Natural Language Processing (NLP) for Similar Languages …

Natural language processing for similar languages, varieties, and dialects: A survey

M Zampieri, P Nakov, Y Scherrer - Natural Language Engineering, 2020 - cambridge.org
There has been a lot of recent interest in the natural language processing (NLP) community
in the computational processing of language varieties and dialects, with the aim to improve …

ADI17: A fine-grained Arabic dialect identification dataset

S Shon, A Ali, Y Samih, H Mubarak… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
In this paper, we describe a method to collect dialectal speech from YouTube videos to
create a large-scale Dialect Identification (DID) dataset. Using this method, we collected …

Language variety identification with true labels

M Zampieri, K North, T Jauhiainen, M Felice… - arxiv preprint arxiv …, 2023 - arxiv.org
Language identification is an important first step in many IR and NLP applications. Most
publicly available language identification datasets, however, are compiled under the …