BERT models for Arabic text classification: a systematic review

AS Alammary - Applied Sciences, 2022 - mdpi.com
Bidirectional Encoder Representations from Transformers (BERT) has gained increasing
attention from researchers and practitioners as it has proven to be an invaluable technique …

A comprehensive review on transformers models for text classification

R Kora, A Mohammed - 2023 International Mobile, Intelligent …, 2023 - ieeexplore.ieee.org
The rapid progress in deep learning has propelled transformer-based models to the
forefront, establishing them as leading solutions for a multiple NLP tasks. These tasks span …

Improving the Detection of Multilingual Online Attacks with Rich Social Media Data from Singapore

J Haber, B Vidgen, M Chapman… - Proceedings of the …, 2023 - aclanthology.org
Toxic content is a global problem, but most resources for detecting toxic content are in
English. When datasets are created in other languages, they often focus exclusively on one …

Out of thin air: Is zero-shot cross-lingual keyword detection better than unsupervised?

B Koloski, S Pollak, B Škrlj, M Martinc - arxiv preprint arxiv:2202.06650, 2022 - arxiv.org
Keyword extraction is the task of retrieving words that are essential to the content of a given
document. Researchers proposed various approaches to tackle this problem. At the top …

[HTML][HTML] Investigating toxicity changes of cross-community redditors from 2 billion posts and comments

H Almerekhi, H Kwak, BJ Jansen - PeerJ Computer Science, 2022 - peerj.com
This research investigates changes in online behavior of users who publish in multiple
communities on Reddit by measuring their toxicity at two levels. With the aid of …

Cordyceps@ LT-EDI: Patching Language-Specific Homophobia/Transphobia Classifiers with a Multilingual Understanding

D Ninalga - arxiv preprint arxiv:2309.13561, 2023 - arxiv.org
Detecting transphobia, homophobia, and various other forms of hate speech is difficult.
Signals can vary depending on factors such as language, culture, geographical region, and …

Unmasking coordinated hate: Analysing hate speech on Spanish digital news media

S Arce-García, E Said-Hung… - New Media & …, 2024 - journals.sagepub.com
This study examines the characteristics and behaviours of accounts that propagate hate
speech through their responses to articles posted on five leading digital news media in …

Multilingual auxiliary tasks training: Bridging the gap between languages for zero-shot transfer of hate speech detection models

S Montariol, A Riabi, D Seddah - arxiv preprint arxiv:2210.13029, 2022 - arxiv.org
Zero-shot cross-lingual transfer learning has been shown to be highly challenging for tasks
involving a lot of linguistic specificities or when a cultural gap is present between languages …

Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges

A Jiang, A Zubiaga - arxiv preprint arxiv:2401.09244, 2024 - arxiv.org
The growing prevalence and rapid evolution of offensive language in social media amplify
the complexities of detection, particularly highlighting the challenges in identifying such …

HATE-ITA: Hate speech detection in Italian social media text

D Nozza, F Bianchi, G Attanasio - … of the Sixth Workshop on Online …, 2022 - aclanthology.org
Online hate speech is a dangerous phenomenon that can (and should) be promptly
counteracted properly. While Natural Language Processing supplies appropriate algorithms …