[HTML][HTML] A systematic review of hate speech automatic detection using natural language processing

MS Jahan, M Oussalah - Neurocomputing, 2023 - Elsevier
With the multiplication of social media platforms, which offer anonymity, easy access and
online community formation and online debate, the issue of hate speech detection and …

A literature review of textual hate speech detection methods and datasets

F Alkomah, X Ma - Information, 2022 - mdpi.com
Online toxic discourses could result in conflicts between groups or harm to online
communities. Hate speech is complex and multifaceted harmful or offensive content …

Persistent interaction patterns across social media platforms and over time

M Avalle, N Di Marco, G Etta, E Sangiorgio, S Alipour… - Nature, 2024 - nature.com
Growing concern surrounds the impact of social media platforms on public discourse,,–and
their influence on social dynamics,,,–, especially in the context of toxicity,–. Here, to better …

Toxicchat: Unveiling hidden challenges of toxicity detection in real-world user-ai conversation

Z Lin, Z Wang, Y Tong, Y Wang, Y Guo, Y Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
Despite remarkable advances that large language models have achieved in chatbots,
maintaining a non-toxic user-AI interactive environment has become increasingly critical …

Towards generalisable hate speech detection: a review on obstacles and solutions

W Yin, A Zubiaga - PeerJ Computer Science, 2021 - peerj.com
Hate speech is one type of harmful online content which directly attacks or promotes hate
towards a group or an individual member based on their actual or perceived aspects of …

Overview of the hasoc subtrack at fire 2021: Hate speech and offensive content identification in english and indo-aryan languages and conversational hate speech

S Modha, T Mandl, GK Shahi, H Madhu… - Proceedings of the 13th …, 2021 - dl.acm.org
The HASOC track is dedicated to the evaluation of technology for finding Offensive
Language and Hate Speech. HASOC is creating a multilingual data corpus mainly for …

Handling bias in toxic speech detection: A survey

T Garg, S Masud, T Suresh, T Chakraborty - ACM Computing Surveys, 2023 - dl.acm.org
Detecting online toxicity has always been a challenge due to its inherent subjectivity. Factors
such as the context, geography, socio-political climate, and background of the producers …

An expert annotated dataset for the detection of online misogyny

E Guest, B Vidgen, A Mittos, N Sastry… - Proceedings of the …, 2021 - aclanthology.org
Online misogyny is a pernicious social problem that risks making online platforms toxic and
unwelcoming to women. We present a new hierarchical taxonomy for online misogyny, as …

Ethical reasoning over moral alignment: A case and framework for in-context ethical policies in LLMs

A Rao, A Khandelwal, K Tanmay, U Agarwal… - arxiv preprint arxiv …, 2023 - arxiv.org
In this position paper, we argue that instead of morally aligning LLMs to specific set of ethical
principles, we should infuse generic ethical reasoning capabilities into them so that they can …

[PDF][PDF] SafetyKit: First aid for measuring safety in open-domain conversational systems

E Dinan, G Abercrombie, SA Bergman… - Proceedings of the …, 2022 - iris.unibocconi.it
The social impact of natural language processing and its applications has received
increasing attention. In this position paper, we focus on the problem of safety for end-to-end …