Directions in abusive language training data, a systematic review: Garbage in, garbage out

B Vidgen, L Derczynski - Plos one, 2020 - journals.plos.org
Data-driven and machine learning based approaches for detecting, categorising and
measuring abusive content such as hate speech and harassment have gained traction due …

Mitigating bias in algorithmic systems—a fish-eye view

K Orphanou, J Otterbacher, S Kleanthous… - ACM Computing …, 2022 - dl.acm.org
Mitigating bias in algorithmic systems is a critical issue drawing attention across
communities within the information and computer sciences. Given the complexity of the …

Disproportionate removals and differing content moderation experiences for conservative, transgender, and black social media users: Marginalization and moderation …

OL Haimson, D Delmonaco, P Nie… - Proceedings of the ACM on …, 2021 - dl.acm.org
Social media sites use content moderation to attempt to cultivate safe spaces with accurate
information for their users. However, content moderation decisions may not be applied …

Quantifying and alleviating political bias in language models

R Liu, C Jia, J Wei, G Xu, S Vosoughi - Artificial Intelligence, 2022 - Elsevier
Current large-scale language models can be politically biased as a result of the data they
are trained on, potentially causing serious problems when they are deployed in real-world …

Content moderation folk theories and perceptions of platform spirit among marginalized social media users

S Mayworm, MA DeVito, D Delmonaco… - ACM Transactions on …, 2024 - dl.acm.org
Social media users create folk theories to help explain how elements of social media
operate. Marginalized social media users face disproportionate content moderation and …

“Is COVID-19 a hoax?”: auditing the quality of COVID-19 conspiracy-related information and misinformation in Google search results in four languages

S Dabran-Zivan, A Baram-Tsabari, R Shapira… - Internet …, 2023 - emerald.com
Purpose Accurate information is the basis for well-informed decision-making, which is
particularly challenging in the dynamic reality of a pandemic. Search engines are a major …

[HTML][HTML] Empowering Indonesian internet users: An approach to counter online toxicity and enhance digital well-being

A Alamsyah, Y Sagama - Intelligent Systems with Applications, 2024 - Elsevier
The proliferation of online toxicity, characterized by offensive and disrespectful language,
has been a pervasive issue in Indonesia's digital environment, impacting users' mental …

Explainable AI to Mitigate the Lack of Transparency and Legitimacy in Internet Moderation

TP Ferraz, CHD Duarte, MF Ribeiro… - Estudos …, 2024 - SciELO Brasil
ISTORICALLY, societies have always established norms of coexistence that impose limits
on individual rights. This was the case with the fundamental right to freedom of expression …

Setting the record straighter on shadow banning

E Le Merrer, B Morgan, G Trédan - IEEE INFOCOM 2021-IEEE …, 2021 - ieeexplore.ieee.org
Shadow banning consists for an online social net-work in limiting the visibility of some of its
users, without them being aware of it. Twitter declares that it does not use such a practice …

Fine-grained classification of political bias in German news: A data set and initial experiments

D Aksenov, P Bourgonje, K Zaczynska… - Proceedings of the …, 2021 - aclanthology.org
We present a data set consisting of German news articles labeled for political bias on a five-
point scale in a semi-supervised way. While earlier work on hyperpartisan news detection …