[HTML][HTML] Gender bias in transformers: A comprehensive review of detection and mitigation strategies

P Nemani, YD Joel, P Vijay, FF Liza - Natural Language Processing …, 2024 - Elsevier
Gender bias in artificial intelligence (AI) has emerged as a pressing concern with profound
implications for individuals' lives. This paper presents a comprehensive survey that explores …

Open problems and fundamental limitations of reinforcement learning from human feedback

S Casper, X Davies, C Shi, TK Gilbert… - arxiv preprint arxiv …, 2023 - arxiv.org
Reinforcement learning from human feedback (RLHF) is a technique for training AI systems
to align with human goals. RLHF has emerged as the central method used to finetune state …

Exposing implicit biases and stereotypes in human and artificial intelligence: state of the art and challenges with a focus on gender

L Marinucci, C Mazzuca, A Gangemi - AI & SOCIETY, 2023 - Springer
Biases in cognition are ubiquitous. Social psychologists suggested biases and stereotypes
serve a multifarious set of cognitive goals, while at the same time stressing their potential …

Towards a science of human-ai decision making: a survey of empirical studies

V Lai, C Chen, QV Liao, A Smith-Renner… - arxiv preprint arxiv …, 2021 - arxiv.org
As AI systems demonstrate increasingly strong predictive performance, their adoption has
grown in numerous domains. However, in high-stakes domains such as criminal justice and …

How child welfare workers reduce racial disparities in algorithmic decisions

HF Cheng, L Stapleton, A Kawakami… - Proceedings of the …, 2022 - dl.acm.org
Machine learning tools have been deployed in various contexts to support human decision-
making, in the hope that human-algorithm collaboration can improve decision quality …

Disentangling fairness perceptions in algorithmic decision-making: the effects of explanations, human oversight, and contestability

M Yurrita, T Draws, A Balayn, D Murray-Rust… - Proceedings of the …, 2023 - dl.acm.org
Recent research claims that information cues and system attributes of algorithmic decision-
making processes affect decision subjects' fairness perceptions. However, little is still known …

Ground (less) truth: A causal framework for proxy labels in human-algorithm decision-making

L Guerdan, A Coston, ZS Wu, K Holstein - Proceedings of the 2023 ACM …, 2023 - dl.acm.org
A growing literature on human-AI decision-making investigates strategies for combining
human judgment with statistical models to improve decision-making. Research in this area …

A checklist to combat cognitive biases in crowdsourcing

T Draws, A Rieger, O Inel, U Gadiraju… - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Recent research has demonstrated that cognitive biases such as the confirmation bias or the
anchoring effect can negatively affect the quality of crowdsourced data. In practice, however …

Managing bias and unfairness in data for decision support: a survey of machine learning and data engineering approaches to identify and mitigate bias and …

A Balayn, C Lofi, GJ Houben - The VLDB Journal, 2021 - Springer
The increasing use of data-driven decision support systems in industry and governments is
accompanied by the discovery of a plethora of bias and unfairness issues in the outputs of …

Investigations of performance and bias in human-AI teamwork in hiring

A Peng, B Nushi, E Kiciman, K Inkpen… - Proceedings of the AAAI …, 2022 - ojs.aaai.org
In AI-assisted decision-making, effective hybrid (human-AI) teamwork is not solely
dependent on AI performance alone, but also on its impact on human decision-making …