A survey on fairness in large language models

Y Li, M Du, R Song, X Wang, Y Wang - arxiv preprint arxiv:2308.10149, 2023 - arxiv.org
Large Language Models (LLMs) have shown powerful performance and development
prospects and are widely deployed in the real world. However, LLMs can capture social …

Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models

Z Lin, S Guan, W Zhang, H Zhang, Y Li… - Artificial Intelligence …, 2024 - Springer
Recently, large language models (LLMs) have attracted considerable attention due to their
remarkable capabilities. However, LLMs' generation of biased or hallucinatory content …

Fairness in large language models: A taxonomic survey

Z Chu, Z Wang, W Zhang - ACM SIGKDD explorations newsletter, 2024 - dl.acm.org
Large Language Models (LLMs) have demonstrated remarkable success across various
domains. However, despite their promising performance in numerous real-world …

[PDF][PDF] Bias and fairness in large language models: A survey

IO Gallegos, RA Rossi, J Barrow, MM Tanjim… - Computational …, 2024 - direct.mit.edu
Rapid advancements of large language models (LLMs) have enabled the processing,
understanding, and generation of human-like text, with increasing integration into systems …

Queer people are people first: Deconstructing sexual identity stereotypes in large language models

H Dhingra, P Jayashanker, S Moghe… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) are trained primarily on minimally processed web text,
which exhibits the same wide range of social biases held by the humans who created that …

Hi guys or hi folks? benchmarking gender-neutral machine translation with the GeNTE corpus

A Piergentili, B Savoldi, D Fucci, M Negri… - arxiv preprint arxiv …, 2023 - arxiv.org
Gender inequality is embedded in our communication practices and perpetuated in
translation technologies. This becomes particularly apparent when translating into …

Self-debiasing large language models: Zero-shot recognition and reduction of stereotypes

IO Gallegos, RA Rossi, J Barrow, MM Tanjim… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) have shown remarkable advances in language generation
and understanding but are also prone to exhibiting harmful social biases. While recognition …

Exploiting biased models to de-bias text: A gender-fair rewriting model

C Amrhein, F Schottmann, R Sennrich… - arxiv preprint arxiv …, 2023 - arxiv.org
Natural language generation models reproduce and often amplify the biases present in their
training data. Previous research explored using sequence-to-sequence rewriting models to …

Under the morphosyntactic lens: A multifaceted evaluation of gender bias in speech translation

B Savoldi, M Gaido, L Bentivogli, M Negri… - arxiv preprint arxiv …, 2022 - arxiv.org
Gender bias is largely recognized as a problematic phenomenon affecting language
technologies, with recent studies underscoring that it might surface differently across …

Gender neutralization for an inclusive machine translation: from theoretical foundations to open challenges

A Piergentili, D Fucci, B Savoldi, L Bentivogli… - arxiv preprint arxiv …, 2023 - arxiv.org
Gender inclusivity in language technologies has become a prominent research topic. In this
study, we explore gender-neutral translation (GNT) as a form of gender inclusivity and a goal …