Unsupervised and semi-supervised bias benchmarking in face recognition

A Chouldechova, S Deng, Y Wang, W **a… - European conference on …, 2022 - Springer
Abstract We introduce Semi-supervised Performance Evaluation for Face Recognition (SPE-
FR). SPE-FR is a statistical method for evaluating the performance and algorithmic bias of …

AVeCQ: Anonymous Verifiable Crowdsourcing with Worker Qualities

V Koutsos, S Damle, D Papadopoulos… - … on Dependable and …, 2024 - ieeexplore.ieee.org
In crowdsourcing systems, requesters publish tasks, and interested workers provide
answers to get rewards. Worker anonymity motivates participation since it protects their …

Quality of sentiment analysis tools: The reasons of inconsistency

WM Kouadri, M Ouziri, S Benbernou… - Proceedings of the …, 2020 - dl.acm.org
In this paper, we present a comprehensive study that evaluates six state-of-the-art sentiment
analysis tools on five public datasets, based on the quality of predictive results in the …

Deploying data driven applications in smart buildings: Overcoming the initial onboarding barrier using machine learning

D Waterworth, S Sethuvenkatraman, QZ Sheng - Energy and Buildings, 2023 - Elsevier
The introduction of smart building technology promises many operational and productivity
benefits and enables smart grid integration. A significant barrier to deploying smart building …

Recovering top-two answers and confusion probability in multi-choice crowdsourcing

H Jeong, HW Chung - International Conference on Machine …, 2023 - proceedings.mlr.press
Crowdsourcing has emerged as an effective platform for labeling large amounts of data in a
cost-and time-efficient manner. Most previous work has focused on designing an efficient …

Record Fusion via Inference and Data Augmentation

A Heidari, G Michalopoulos, IF Ilyas… - ACM/JMS Journal of Data …, 2024 - dl.acm.org
We introduce a learning framework for the problem of unifying conflicting data in multiple
records referring to the same entity—we call this problem “record fusion.” Record fusion …

[HTML][HTML] Understanding Confusion: A Case Study of Training a Machine Model to Predict and Interpret Consensus From Volunteer Labels

R Sankar, K Mantha, C Nesmith… - Citizen …, 2024 - theoryandpractice …
Citizen science has become a valuable and reliable method for interpreting and processing
big datasets, and is vital in the era of ever-growing data volumes. However, there are …

AVeCQ: Anonymous Verifiable Crowdsourcing with Worker Qualities

S Damle, V Koutsos, D Papadopoulos… - arxiv preprint arxiv …, 2023 - arxiv.org
In crowdsourcing systems, requesters publish tasks, and interested workers provide
answers to get rewards. Worker anonymity motivates participation since it protects their …

In search of ambiguity: A three-stage workflow design to clarify annotation guidelines for crowd workers

VK Pradhan, M Schaekermann… - Frontiers in Artificial …, 2022 - frontiersin.org
We propose a novel three-stage FIND-RESOLVE-LABEL workflow for crowdsourced
annotation to reduce ambiguity in task instructions and, thus, improve annotation quality …

Exploring the Learning Difficulty of Data: Theory and Measure

W Zhu, O Wu, F Su, Y Deng - ACM Transactions on Knowledge …, 2024 - dl.acm.org
''Easy/hard sample” is a popular parlance in machine learning. Learning difficulty of samples
refers to how easy/hard a sample is during a learning procedure. An increasing need of …