A literature review of textual hate speech detection methods and datasets

F Alkomah, X Ma - Information, 2022 - mdpi.com
Online toxic discourses could result in conflicts between groups or harm to online
communities. Hate speech is complex and multifaceted harmful or offensive content …

[HTML][HTML] A systematic review of hate speech automatic detection using natural language processing

MS Jahan, M Oussalah - Neurocomputing, 2023 - Elsevier
With the multiplication of social media platforms, which offer anonymity, easy access and
online community formation and online debate, the issue of hate speech detection and …

Into the laion's den: Investigating hate in multimodal datasets

A Birhane, S Han, V Boddeti… - Advances in neural …, 2023 - proceedings.neurips.cc
AbstractScale the model, scale the data, scale the compute'is the reigning sentiment in the
world of generative AI today. While the impact of model scaling has been extensively …

Dynabench: Rethinking benchmarking in NLP

D Kiela, M Bartolo, Y Nie, D Kaushik, A Geiger… - arxiv preprint arxiv …, 2021 - arxiv.org
We introduce Dynabench, an open-source platform for dynamic dataset creation and model
benchmarking. Dynabench runs in a web browser and supports human-and-model-in-the …

Hatebert: Retraining bert for abusive language detection in english

T Caselli, V Basile, J Mitrović, M Granitzer - arxiv preprint arxiv …, 2020 - arxiv.org
In this paper, we introduce HateBERT, a re-trained BERT model for abusive language
detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit …

Overview of the hasoc track at fire 2019: Hate speech and offensive content identification in indo-european languages

T Mandl, S Modha, P Majumder, D Patel… - Proceedings of the 11th …, 2019 - dl.acm.org
The identification of Hate Speech in Social Media is of great importance and receives much
attention in the text classification community. There is a huge demand for research for …

HateCheck: Functional tests for hate speech detection models

P Röttger, B Vidgen, D Nguyen, Z Waseem… - arxiv preprint arxiv …, 2020 - arxiv.org
Detecting online hate is a difficult task that even state-of-the-art models struggle with.
Typically, hate speech detection models are evaluated by measuring their performance on …

Towards generalisable hate speech detection: a review on obstacles and solutions

W Yin, A Zubiaga - PeerJ Computer Science, 2021 - peerj.com
Hate speech is one type of harmful online content which directly attacks or promotes hate
towards a group or an individual member based on their actual or perceived aspects of …

Learning from the worst: Dynamically generated datasets to improve online hate detection

B Vidgen, T Thrush, Z Waseem, D Kiela - arxiv preprint arxiv:2012.15761, 2020 - arxiv.org
We present a human-and-model-in-the-loop process for dynamically generating datasets
and training better performing and more robust hate detection models. We provide a new …

Overview of the hasoc track at fire 2020: Hate speech and offensive language identification in tamil, malayalam, hindi, english and german

T Mandl, S Modha, A Kumar M… - Proceedings of the 12th …, 2020 - dl.acm.org
This paper presents the HASOC track and its two parts. HASOC is dedicated to evaluate
technology for finding Offensive Language and Hate Speech. HASOC is creating test …