Towards trustworthy and aligned machine learning: A data-centric survey with causality perspectives
The trustworthiness of machine learning has emerged as a critical topic in the field,
encompassing various applications and research areas such as robustness, security …
encompassing various applications and research areas such as robustness, security …
Stable bias: Analyzing societal representations in diffusion models
As machine learning-enabled Text-to-Image (TTI) systems are becoming increasingly
prevalent and seeing growing adoption as commercial services, characterizing the social …
prevalent and seeing growing adoption as commercial services, characterizing the social …
Is your toxicity my toxicity? exploring the impact of rater identity on toxicity annotation
N Goyal, ID Kivlichan, R Rosen… - Proceedings of the ACM …, 2022 - dl.acm.org
Machine learning models are commonly used to detect toxicity in online conversations.
These models are trained on datasets annotated by human raters. We explore how raters' …
These models are trained on datasets annotated by human raters. We explore how raters' …
A survey on multilingual large language models: Corpora, alignment, and bias
Y Xu, L Hu, J Zhao, Z Qiu, K XU, Y Ye, H Gu - arxiv preprint arxiv …, 2024 - arxiv.org
Based on the foundation of Large Language Models (LLMs), Multilingual LLMs (MLLMs)
have been developed to address the challenges faced in multilingual natural language …
have been developed to address the challenges faced in multilingual natural language …
Fairness in language models beyond English: Gaps and challenges
With language models becoming increasingly ubiquitous, it has become essential to
address their inequitable treatment of diverse demographic groups and factors. Most …
address their inequitable treatment of diverse demographic groups and factors. Most …
Understanding instance-level impact of fairness constraints
A variety of fairness constraints have been proposed in the literature to mitigate group-level
statistical bias. Their impacts have been largely evaluated for different groups of populations …
statistical bias. Their impacts have been largely evaluated for different groups of populations …
Vision-language models performing zero-shot tasks exhibit disparities between gender groups
We explore the extent to which zero-shot vision-language models exhibit gender bias for
different vision tasks. Vision models traditionally required task-specific labels for …
different vision tasks. Vision models traditionally required task-specific labels for …
Tibet: Identifying and evaluating biases in text-to-image generative models
Abstract Text-to-Image (TTI) generative models have shown great progress in the past few
years in terms of their ability to generate complex and high-quality imagery. At the same …
years in terms of their ability to generate complex and high-quality imagery. At the same …
Mitigating test-time bias for fair image retrieval
We address the challenge of generating fair and unbiased image retrieval results given
neutral textual queries (with no explicit gender or race connotations), while maintaining the …
neutral textual queries (with no explicit gender or race connotations), while maintaining the …
A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers
The rapid development of Large Language Models (LLMs) demonstrates remarkable
multilingual capabilities in natural language processing, attracting global attention in both …
multilingual capabilities in natural language processing, attracting global attention in both …