Harnessing artificial intelligence to combat online hate: Exploring the challenges and opportunities of large language models in hate speech detection

T Kumarage, A Bhattacharjee, J Garland - arxiv preprint arxiv:2403.08035, 2024 - arxiv.org
Large language models (LLMs) excel in many diverse applications beyond language
generation, eg, translation, summarization, and sentiment analysis. One intriguing …

Towards interpretable hate speech detection using large language model-extracted rationales

A Nirmal, A Bhattacharjee, P Sheth, H Liu - arxiv preprint arxiv …, 2024 - arxiv.org
Although social media platforms are a prominent arena for users to engage in interpersonal
discussions and express opinions, the facade and anonymity offered by social media may …

Interpretable hate speech detection via large language model-extracted rationales

A Nirmal - 2024 - search.proquest.com
Social media platforms have become widely used for open communication, yet their lack of
moderation has led to the proliferation of harmful content, including hate speech. Manual …

Causality guided disentanglement for cross-platform hate speech detection

P Sheth, R Moraffah, TS Kumarage… - Proceedings of the 17th …, 2024 - dl.acm.org
espite their value in promoting open discourse, social media plat-forms are often exploited to
spread harmful content. Current deep learning and natural language processing models …

Hate speech detection with generalizable target-aware fairness

T Chen, D Wang, X Liang, M Risius… - Proceedings of the 30th …, 2024 - dl.acm.org
To counter the side effect brought by the proliferation of social media platforms, hate speech
detection (HSD) plays a vital role in halting the dissemination of toxic online posts at an early …

Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement

P Sheth, T Kumarage, R Moraffah… - … Conference on Big …, 2024 - ieeexplore.ieee.org
Content moderation on social media faces increasing challenges due to the rapid evolution
of hate speech. Identifying hate speech is challenging, especially as it constantly evolves to …

Learn from Failure: Causality-guided Contrastive Learning for Generalizable Implicit Hate Speech Detection

T Jiang - Proceedings of the 31st International Conference on …, 2025 - aclanthology.org
Implicit hate speech presents a significant challenge for automatic detection systems due to
its subtlety and ambiguity. Traditional models trained using empirical risk minimization …

Silence Speaks Volumes: Re-weighting Techniques for Under-Represented Users in Fake News Detection

M Karami, D Mosallanezhad, P Sheth… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Social media platforms provide a rich environment for analyzing user behavior. Recently,
deep learning-based methods have been a mainstream approach for social media analysis …

Between black and white: From cross-domain generalisation to user customisation in automated hate speech detection

W Yin - 2024 - qmro.qmul.ac.uk
The rapid growth of social media platforms highlighted an urgent need for the moderation of
hateful language, which not only harms the online community and its users, but also carries …

Large language models and causal analysis: zero-shot counterfactuals in hate speech perception

S Hernández Jiménez - 2024 - diposit.ub.edu
[en] Detecting hate speech is crucial for maintaining the integrity of social media platforms,
as it involves identifying content that denigrates individuals or groups based on their …