[HTML][HTML] A survey on dataset quality in machine learning

Y Gong, G Liu, Y Xue, R Li, L Meng - Information and Software Technology, 2023 - Elsevier
With the rise of big data, the quality of datasets has become a crucial factor affecting the
performance of machine learning models. High-quality datasets are essential for the …

Representation bias in data: A survey on identification and resolution techniques

N Shahbazi, Y Lin, A Asudeh, HV Jagadish - ACM Computing Surveys, 2023 - dl.acm.org
Data-driven algorithms are only as good as the data they work with, while datasets,
especially social data, often fail to represent minorities adequately. Representation Bias in …

Towards unbounded machine unlearning

M Kurmanji, P Triantafillou, J Hayes… - Advances in neural …, 2023 - proceedings.neurips.cc
Deep machine unlearning is the problem of'removing'from a trained neural network a subset
of its training set. This problem is very timely and has many applications, including the key …

Policy advice and best practices on bias and fairness in AI

JM Alvarez, AB Colmenarejo, A Elobaid… - Ethics and Information …, 2024 - Springer
The literature addressing bias and fairness in AI models (fair-AI) is growing at a fast pace,
making it difficult for novel researchers and practitioners to have a bird's-eye view picture of …

REVISE: A tool for measuring and mitigating bias in visual datasets

A Wang, A Liu, R Zhang, A Kleiman, L Kim… - International Journal of …, 2022 - Springer
Abstract Machine learning models are known to perpetuate and even amplify the biases
present in the data. However, these data biases frequently do not become apparent until …

Would Deep Generative Models Amplify Bias in Future Models?

T Chen, Y Hirota, M Otani, N Garcia… - Proceedings of the …, 2024 - openaccess.thecvf.com
We investigate the impact of deep generative models on potential social biases in upcoming
computer vision models. As the internet witnesses an increasing influx of AI-generated …

[HTML][HTML] Computational pathology: a survey review and the way forward

MS Hosseini, BE Bejnordi, VQH Trinh, L Chan… - Journal of Pathology …, 2024 - Elsevier
Abstract Computational Pathology (CPath) is an interdisciplinary science that augments
developments of computational approaches to analyze and model medical histopathology …

Algorithmic fairness datasets: the story so far

A Fabris, S Messina, G Silvello, GA Susto - Data Mining and Knowledge …, 2022 - Springer
Data-driven algorithms are studied and deployed in diverse domains to support critical
decisions, directly impacting people's well-being. As a result, a growing community of …

Socialcounterfactuals: Probing and mitigating intersectional social biases in vision-language models with counterfactual examples

P Howard, A Madasu, T Le… - Proceedings of the …, 2024 - openaccess.thecvf.com
While vision-language models (VLMs) have achieved remarkable performance
improvements recently there is growing evidence that these models also posses harmful …

Exploring the impact of dataset bias on dataset distillation

Y Lu, J Gu, X Chen, S Vahidian… - Proceedings of the …, 2024 - openaccess.thecvf.com
Dataset Distillation (DD) is a promising technique to synthesize a smaller dataset that
preserves essential information from the original dataset. This synthetic dataset can serve as …