Representation bias in data: A survey on identification and resolution techniques

N Shahbazi, Y Lin, A Asudeh, HV Jagadish - ACM Computing Surveys, 2023 - dl.acm.org
Data-driven algorithms are only as good as the data they work with, while datasets,
especially social data, often fail to represent minorities adequately. Representation Bias in …

Mitigating bias in algorithmic systems—a fish-eye view

K Orphanou, J Otterbacher, S Kleanthous… - ACM Computing …, 2022 - dl.acm.org
Mitigating bias in algorithmic systems is a critical issue drawing attention across
communities within the information and computer sciences. Given the complexity of the …

SliceTeller: A data slice-driven approach for machine learning model validation

X Zhang, JP Ono, H Song, L Gou… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Real-world machine learning applications need to be thoroughly evaluated to meet critical
product requirements for model release, to ensure fairness for different groups or …

Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models

P Rauba, N Seedat, M Ruiz Luyten… - Advances in Neural …, 2025 - proceedings.neurips.cc
The predominant de facto paradigm of testing ML models relies on either using only held-out
data to compute aggregate evaluation metrics or by assessing the performance on different …

A unified interactive model evaluation for classification, object detection, and instance segmentation in computer vision

C Chen, Y Guo, F Tian, S Liu, W Yang… - … on Visualization and …, 2023 - ieeexplore.ieee.org
Existing model evaluation tools mainly focus on evaluating classification models, leaving a
gap in evaluating more complex models, such as object detection. In this paper, we develop …

Fairness-aware range queries for selecting unbiased data

S Shetiya, IP Swift, A Asudeh… - 2022 IEEE 38th …, 2022 - ieeexplore.ieee.org
We are being constantly judged by automated decision systems that have been widely
criticised for being discriminatory and unfair. Since an algorithm is only as good as the data …

[PDF][PDF] Benchmarking post-hoc interpretability approaches for transformer-based misogyny detection

G Attanasio, D Nozza, E Pastor… - Proceedings of NLP …, 2022 - iris.unibocconi.it
Warning: This paper contains examples of language that some people may find offensive.
Transformer-based Natural Language Processing models have become the standard for …

Prioritizing data acquisition for end-to-end speech model improvement

A Koudounas, E Pastor, G Attanasio… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
As speech processing moves toward more data-hungry models, data selection and
acquisition become crucial to building better systems. Recent efforts have championed …

[PDF][PDF] A survey on techniques for identifying and resolving representation bias in data

N Shahbazi, Y Lin, A Asudeh… - CoRR, abs …, 2022 - researchgate.net
Data-driven decision-making shapes every corner of human life today, from autonomous
vehicles to healthcare, and even in predictive policing and criminal sentencing. A critical …

Boosting court judgment prediction and explanation using legal entities

I Benedetto, A Koudounas, L Vaiani, E Pastor… - Artificial Intelligence and …, 2024 - Springer
The automatic prediction of court case judgments using Deep Learning and Natural
Language Processing is challenged by the variety of norms and regulations, the inherent …