Towards trustworthy and aligned machine learning: A data-centric survey with causality perspectives

H Liu, M Chaudhary, H Wang - arxiv preprint arxiv:2307.16851, 2023 - arxiv.org
The trustworthiness of machine learning has emerged as a critical topic in the field,
encompassing various applications and research areas such as robustness, security …

Stable bias: Analyzing societal representations in diffusion models

AS Luccioni, C Akiki, M Mitchell, Y Jernite - arxiv preprint arxiv …, 2023 - arxiv.org
As machine learning-enabled Text-to-Image (TTI) systems are becoming increasingly
prevalent and seeing growing adoption as commercial services, characterizing the social …

Is your toxicity my toxicity? exploring the impact of rater identity on toxicity annotation

N Goyal, ID Kivlichan, R Rosen… - Proceedings of the ACM …, 2022 - dl.acm.org
Machine learning models are commonly used to detect toxicity in online conversations.
These models are trained on datasets annotated by human raters. We explore how raters' …

A survey on multilingual large language models: Corpora, alignment, and bias

Y Xu, L Hu, J Zhao, Z Qiu, K XU, Y Ye, H Gu - arxiv preprint arxiv …, 2024 - arxiv.org
Based on the foundation of Large Language Models (LLMs), Multilingual LLMs (MLLMs)
have been developed to address the challenges faced in multilingual natural language …

Fairness in language models beyond English: Gaps and challenges

K Ramesh, S Sitaram, M Choudhury - arxiv preprint arxiv:2302.12578, 2023 - arxiv.org
With language models becoming increasingly ubiquitous, it has become essential to
address their inequitable treatment of diverse demographic groups and factors. Most …

Understanding instance-level impact of fairness constraints

J Wang, XE Wang, Y Liu - International Conference on …, 2022 - proceedings.mlr.press
A variety of fairness constraints have been proposed in the literature to mitigate group-level
statistical bias. Their impacts have been largely evaluated for different groups of populations …

Vision-language models performing zero-shot tasks exhibit disparities between gender groups

M Hall, L Gustafson, A Adcock… - Proceedings of the …, 2023 - openaccess.thecvf.com
We explore the extent to which zero-shot vision-language models exhibit gender bias for
different vision tasks. Vision models traditionally required task-specific labels for …

Tibet: Identifying and evaluating biases in text-to-image generative models

A Chinchure, P Shukla, G Bhatt, K Salij… - … on Computer Vision, 2024 - Springer
Abstract Text-to-Image (TTI) generative models have shown great progress in the past few
years in terms of their ability to generate complex and high-quality imagery. At the same …

Mitigating test-time bias for fair image retrieval

F Kong, S Yuan, W Hao… - Advances in Neural …, 2024 - proceedings.neurips.cc
We address the challenge of generating fair and unbiased image retrieval results given
neutral textual queries (with no explicit gender or race connotations), while maintaining the …

A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

K Huang, F Mo, H Li, Y Li, Y Zhang, W Yi, Y Mao… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid development of Large Language Models (LLMs) demonstrates remarkable
multilingual capabilities in natural language processing, attracting global attention in both …