Resources and benchmark corpora for hate speech detection: a systematic review
Hate Speech in social media is a complex phenomenon, whose detection has recently
gained significant traction in the Natural Language Processing community, as attested by …
gained significant traction in the Natural Language Processing community, as attested by …
A literature review of textual hate speech detection methods and datasets
F Alkomah, X Ma - Information, 2022 - mdpi.com
Online toxic discourses could result in conflicts between groups or harm to online
communities. Hate speech is complex and multifaceted harmful or offensive content …
communities. Hate speech is complex and multifaceted harmful or offensive content …
Hatebert: Retraining bert for abusive language detection in english
In this paper, we introduce HateBERT, a re-trained BERT model for abusive language
detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit …
detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit …
Latent hatred: A benchmark for understanding implicit hate speech
Hate speech has grown significantly on social media, causing serious consequences for
victims of all demographics. Despite much attention being paid to characterize and detect …
victims of all demographics. Despite much attention being paid to characterize and detect …
Towards generalisable hate speech detection: a review on obstacles and solutions
Hate speech is one type of harmful online content which directly attacks or promotes hate
towards a group or an individual member based on their actual or perceived aspects of …
towards a group or an individual member based on their actual or perceived aspects of …
Learning from the worst: Dynamically generated datasets to improve online hate detection
We present a human-and-model-in-the-loop process for dynamically generating datasets
and training better performing and more robust hate detection models. We provide a new …
and training better performing and more robust hate detection models. We provide a new …
Two contrasting data annotation paradigms for subjective NLP tasks
Labelled data is the foundation of most natural language processing tasks. However,
labelling data is difficult and there often are diverse valid beliefs about what the correct data …
labelling data is difficult and there often are diverse valid beliefs about what the correct data …
Recipes for safety in open-domain chatbots
Models trained on large unlabeled corpora of human interactions will learn patterns and
mimic behaviors therein, which include offensive or otherwise toxic behavior and unwanted …
mimic behaviors therein, which include offensive or otherwise toxic behavior and unwanted …
[PDF][PDF] SafetyKit: First aid for measuring safety in open-domain conversational systems
The social impact of natural language processing and its applications has received
increasing attention. In this position paper, we focus on the problem of safety for end-to-end …
increasing attention. In this position paper, we focus on the problem of safety for end-to-end …
[HTML][HTML] Offensive language identification in dravidian languages using mpnet and cnn
BR Chakravarthi, MB Jagadeeshan… - International Journal of …, 2023 - Elsevier
Social media has effectively replaced traditional forms of communication and marketing. As
these platforms allow for the free expression of ideas and facts through text, images, and …
these platforms allow for the free expression of ideas and facts through text, images, and …