[HTML][HTML] A survey on hate speech detection and sentiment analysis using machine learning and deep learning models

M Subramanian, VE Sathiskumar… - Alexandria Engineering …, 2023 - Elsevier
In today's digital era, the rise of hate speech has emerged as a critical concern, driven by the
rapid information-sharing capabilities of social media platforms and online communities. As …

Data representativity for machine learning and AI systems

LH Clemmensen, RD Kjærsgaard - arxiv preprint arxiv:2203.04706, 2022 - arxiv.org
Data representativity is crucial when drawing inference from data through machine learning
models. Scholars have increased focus on unraveling the bias and fairness in models, also …

SOLD: Sinhala offensive language dataset

T Ranasinghe, I Anuradha, D Premasiri, K Silva… - Language Resources …, 2024 - Springer
The widespread of offensive content online, such as hate speech and cyber-bullying, is a
global phenomenon. This has sparked interest in the artificial intelligence (AI) and natural …

Eyes don't lie: Subjective hate annotation and detection with gaze

Ö Alaçam, S Hoeken, S Zarrieß - Proceedings of the 2024 …, 2024 - aclanthology.org
Hate speech is a complex and subjective phenomenon. In this paper, we present a dataset
(GAZE4HATE) that provides gaze data collected in a hate speech annotation experiment …

Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset

J Goldzycher, P Röttger, G Schneider - arxiv preprint arxiv:2403.19559, 2024 - arxiv.org
Hate speech detection models are only as good as the data they are trained on. Datasets
sourced from social media suffer from systematic gaps and biases, leading to unreliable …

STAND-Guard: A Small Task-Adaptive Content Moderation Model

M Wang, P Lin, S Cai, S An, S Ma, Z Lin… - arxiv preprint arxiv …, 2024 - arxiv.org
Content moderation, the process of reviewing and monitoring the safety of generated
content, is important for development of welcoming online platforms and responsible large …

Demarked: A Strategy for Enhanced Abusive Speech Moderation through Counterspeech, Detoxification, and Message Management

SM Yimam, D Dementieva, T Fischer… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite regulations imposed by nations and social media platforms, such as recent EU
regulations targeting digital violence, abusive content persists as a significant challenge …

Towards an Organically Growing Hate Speech Dataset in Hate Speech Detection Systems in a Smart Mobility Application

A Alsamman, A Schmitz, MA Wimmer - Proceedings of the 24th Annual …, 2023 - dl.acm.org
The automatic detection of hate speech online poses several challenges. A top challenge is
that hate speech changes its targets and its format periodically. While the lack of available …

GMHP7k: A Corpus of German Misogynistic Hatespeech Posts

J Glasebach, ME Keller, A Döschl… - Proceedings of the …, 2024 - ojs.aaai.org
We provide a german corpus consisting of 7,061 posts authored by users of social media
platforms. A group of volunteers annotated each post according to hatespeech and …

HOCON34k: A Corpus of Hate Speech in Online Comments from German Newspapers

ME Keller, M Auch, A Döschl, F Vlk… - … Integration and Web …, 2025 - Springer
We present a dataset of 34,223 comments in German, authored by users of online platforms
associated with public discourse in German newspapers. Each comment was annotated for …