Resources and benchmark corpora for hate speech detection: a systematic review

F Poletto, V Basile, M Sanguinetti, C Bosco… - Language Resources …, 2021 - Springer
Hate Speech in social media is a complex phenomenon, whose detection has recently
gained significant traction in the Natural Language Processing community, as attested by …

A literature review of textual hate speech detection methods and datasets

F Alkomah, X Ma - Information, 2022 - mdpi.com
Online toxic discourses could result in conflicts between groups or harm to online
communities. Hate speech is complex and multifaceted harmful or offensive content …

Hatebert: Retraining bert for abusive language detection in english

T Caselli, V Basile, J Mitrović, M Granitzer - arxiv preprint arxiv …, 2020 - arxiv.org
In this paper, we introduce HateBERT, a re-trained BERT model for abusive language
detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit …

Latent hatred: A benchmark for understanding implicit hate speech

M ElSherief, C Ziems, D Muchlinski, V Anupindi… - arxiv preprint arxiv …, 2021 - arxiv.org
Hate speech has grown significantly on social media, causing serious consequences for
victims of all demographics. Despite much attention being paid to characterize and detect …

Towards generalisable hate speech detection: a review on obstacles and solutions

W Yin, A Zubiaga - PeerJ Computer Science, 2021 - peerj.com
Hate speech is one type of harmful online content which directly attacks or promotes hate
towards a group or an individual member based on their actual or perceived aspects of …

Learning from the worst: Dynamically generated datasets to improve online hate detection

B Vidgen, T Thrush, Z Waseem, D Kiela - arxiv preprint arxiv:2012.15761, 2020 - arxiv.org
We present a human-and-model-in-the-loop process for dynamically generating datasets
and training better performing and more robust hate detection models. We provide a new …

Two contrasting data annotation paradigms for subjective NLP tasks

P Röttger, B Vidgen, D Hovy… - arxiv preprint arxiv …, 2021 - arxiv.org
Labelled data is the foundation of most natural language processing tasks. However,
labelling data is difficult and there often are diverse valid beliefs about what the correct data …

Recipes for safety in open-domain chatbots

J Xu, D Ju, M Li, YL Boureau, J Weston… - arxiv preprint arxiv …, 2020 - arxiv.org
Models trained on large unlabeled corpora of human interactions will learn patterns and
mimic behaviors therein, which include offensive or otherwise toxic behavior and unwanted …

[PDF][PDF] SafetyKit: First aid for measuring safety in open-domain conversational systems

E Dinan, G Abercrombie, SA Bergman… - Proceedings of the …, 2022 - iris.unibocconi.it
The social impact of natural language processing and its applications has received
increasing attention. In this position paper, we focus on the problem of safety for end-to-end …

[HTML][HTML] Offensive language identification in dravidian languages using mpnet and cnn

BR Chakravarthi, MB Jagadeeshan… - International Journal of …, 2023 - Elsevier
Social media has effectively replaced traditional forms of communication and marketing. As
these platforms allow for the free expression of ideas and facts through text, images, and …