Is one annotation enough?-a data-centric image classification benchmark for noisy and ambiguous label estimation

L Schmarje, V Grossmann, C Zelenka… - Advances in …, 2022 - proceedings.neurips.cc
High-quality data is necessary for modern machine learning. However, the acquisition of
such data is difficult due to noisy and ambiguous annotations of humans. The aggregation of …

Opportunities and Challenges in Data-Centric AI

S Kumar, S Datta, V Singh, SK Singh, R Sharma - IEEE Access, 2024 - ieeexplore.ieee.org
Artificial intelligence (AI) systems are trained to solve complex problems and learn to
perform specific tasks by using large volumes of data, such as prediction, classification …

Potential impact of data-centric AI on society

S Kumar, R Sharma, V Singh, S Tiwari… - IEEE Technology …, 2023 - ieeexplore.ieee.org
Data-centric artificial intelligence (AI)(DCAI) has the potential to bring significant benefits to
society; however, it also poses significant challenges and potential risks. It is crucial to …

Beyond hard labels: investigating data label distributions

V Grossmann, L Schmarje, R Koch - arxiv preprint arxiv:2207.06224, 2022 - arxiv.org
High-quality data is a key aspect of modern machine learning. However, labels generated
by humans suffer from issues like label noise and class ambiguities. We raise the question …

A data-centric approach for improving ambiguous labels with combined semi-supervised classification and clustering

L Schmarje, M Santarossa, SM Schröder… - … on Computer Vision, 2022 - Springer
Consistently high data quality is essential for the development of novel loss functions and
architectures in the field of deep learning. The existence of such data and labels is usually …

Annotating Ambiguous Images: General Annotation Strategy for High-Quality Data with Real-World Biomedical Validation

L Schmarje, V Grossmann, C Zelenka… - arxiv preprint arxiv …, 2023 - openreview.net
In the field of image classification, existing methods often struggle with biased or ambiguous
data, a prevalent issue in real-world scenarios. Current strategies, including semi …

[PDF][PDF] Impact of Data Quality on Question Answering System Performances.

R Karra, A Lasfar - Intelligent Automation & Soft Computing, 2023 - academia.edu
In contrast with the research of new models, little attention has been paid to the impact of low
or high-quality data feeding a dialogue system. The present paper makes the first attempt to …

Addressing the Challenge of Ambiguous Data in Deep Learning: A Strategy for Creating High-quality Image Annotations with Human Reliability and Judgement …

L Schmarje - 2024 - macau.uni-kiel.de
In machine learning, the availability of high-quality labeled data is essential for training
accurate models. However, humans often disagree among themselves or over time when …

[PDF][PDF] A data-centric approach for improving ambiguous labels with combined semi-supervised classification and clustering

R Koch - ecva.net
Consistently high data quality is essential for the development of novel loss functions and
architectures in the field of deep learning. The existence of such data and labels is usually …

[PDF][PDF] Habilitando Anotações de Dados Autônomos: Uma Abordagem de Aprendizado por Reforço com Humano no Loop

LC da Cruz - 2022 - maxwell.vrac.puc-rio.br
Leonardo Cardia da Cruz Habilitando Anotações de Dados Autônomos: Uma Abordagem de
Aprendizado por Reforço com Humano no Loo Page 1 Leonardo Cardia da Cruz Habilitando …