The psychological well-being of content moderators: the emotional labor of commercial moderation and avenues for improving support

M Steiger, TJ Bharucha, S Venkatagiri… - Proceedings of the …, 2021 - dl.acm.org
An estimated 100,000 people work today as commercial content moderators. These
moderators are often exposed to disturbing content, which can lead to lasting psychological …

Llama guard: Llm-based input-output safeguard for human-ai conversations

H Inan, K Upasani, J Chi, R Rungta, K Iyer… - arxiv preprint arxiv …, 2023 - arxiv.org
We introduce Llama Guard, an LLM-based input-output safeguard model geared towards
Human-AI conversation use cases. Our model incorporates a safety risk taxonomy, a …

Children's safety on youtube: A systematic review

SI Alqahtani, WMS Yafooz, A Alsaeedi, L Syed… - Applied Sciences, 2023 - mdpi.com
Background: With digital transformation and growing social media usage, kids spend
considerable time on the web, especially watching videos on YouTube. YouTube is a source …

Cf-gnnexplainer: Counterfactual explanations for graph neural networks

A Lucic, MA Ter Hoeve, G Tolomei… - International …, 2022 - proceedings.mlr.press
Given the increasing promise of graph neural networks (GNNs) in real-world applications,
several methods have been developed for explaining their predictions. Existing methods for …

Building human values into recommender systems: An interdisciplinary synthesis

J Stray, A Halevy, P Assar, D Hadfield-Menell… - ACM Transactions on …, 2024 - dl.acm.org
Recommender systems are the algorithms which select, filter, and personalize content
across many of the world's largest platforms and apps. As such, their positive and negative …

Detecting and understanding harmful memes: A survey

S Sharma, F Alam, MS Akhtar, D Dimitrov… - arxiv preprint arxiv …, 2022 - arxiv.org
The automatic identification of harmful content online is of major concern for social media
platforms, policymakers, and society. Researchers have studied textual, visual, and audio …

Sok: Content moderation for end-to-end encryption

S Scheffler, J Mayer - arxiv preprint arxiv:2303.03979, 2023 - arxiv.org
Popular messaging applications now enable end-to-end-encryption (E2EE) by default, and
E2EE data storage is becoming common. These important advances for security and privacy …

[HTML][HTML] Multimodal hate speech detection in greek social media

K Perifanos, D Goutsos - Multimodal Technologies and Interaction, 2021 - mdpi.com
Hateful and abusive speech presents a major challenge for all online social media
platforms. Recent advances in Natural Language Processing and Natural Language …

Analyzing the use of large language models for content moderation with chatgpt examples

M Franco, O Gaggi, CE Palazzi - … of the 3rd International Workshop on …, 2023 - dl.acm.org
Content moderation systems are crucial in Online Social Networks (OSNs). Indeed, their role
is to keep platforms and their users safe from malicious activities. However, there is an …

Designing recommender systems to depolarize

J Stray - arxiv preprint arxiv:2107.04953, 2021 - arxiv.org
Polarization is implicated in the erosion of democracy and the progression to violence,
which makes the polarization properties of large algorithmic content selection systems …