Data-centric artificial intelligence: A survey

D Zha, ZP Bhat, KH Lai, F Yang, Z Jiang… - ACM Computing …, 2023 - dl.acm.org
Artificial Intelligence (AI) is making a profound impact in almost every domain. A vital enabler
of its great success is the availability of abundant and high-quality data for building machine …

Boundary-enhanced co-training for weakly supervised semantic segmentation

S Rong, B Tu, Z Wang, J Li - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
The existing weakly supervised semantic segmentation (WSSS) methods pay much
attention to generating accurate and complete class activation maps (CAMs) as pseudo …

Combating noisy labels with sample selection by mining high-discrepancy examples

X **a, B Han, Y Zhan, J Yu, M Gong… - Proceedings of the …, 2023 - openaccess.thecvf.com
The sample selection approach is popular in learning with noisy labels. The state-of-the-art
methods train two deep networks simultaneously for sample selection, which aims to employ …

Instance-dependent noisy label learning via graphical modelling

A Garg, C Nguyen, R Felix, TT Do… - Proceedings of the …, 2023 - openaccess.thecvf.com
Noisy labels are unavoidable yet troublesome in the ecosystem of deep learning because
models can easily overfit them. There are many types of label noise, such as symmetric …

To aggregate or not? learning with separate noisy labels

J Wei, Z Zhu, T Luo, E Amid, A Kumar… - Proceedings of the 29th …, 2023 - dl.acm.org
The rawly collected training data often comes with separate noisy labels collected from
multiple imperfect annotators (eg, via crowdsourcing). A typical way of using these separate …

Transferring annotator-and instance-dependent transition matrix for learning from crowds

S Li, X **a, J Deng, S Gey, T Liu - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Learning from crowds describes that the annotations of training data are obtained with
crowd-sourcing services. Multiple annotators each complete their own small part of the …

Beyond confusion matrix: learning from multiple annotators with awareness of instance features

J Li, H Sun, J Li - Machine Learning, 2023 - Springer
Learning from multiple annotators aims to induce a high-quality classifier from training
instances, where each of them is associated with a set of observed labels provided by …

Mitigating memorization of noisy labels via regularization between representations

H Cheng, Z Zhu, X Sun, Y Liu - arxiv preprint arxiv:2110.09022, 2021 - arxiv.org
Designing robust loss functions is popular in learning with noisy labels while existing
designs did not explicitly consider the overfitting property of deep neural networks (DNNs) …

Graphcleaner: Detecting mislabelled samples in popular graph learning benchmarks

Y Li, M **ong, B Hooi - International Conference on Machine …, 2023 - proceedings.mlr.press
Label errors have been found to be prevalent in popular text, vision, and audio datasets,
which heavily influence the safe development and evaluation of machine learning …

FedFixer: Mitigating Heterogeneous Label Noise in Federated Learning

X Ji, Z Zhu, W **, O Gadyatskaya, Z Song… - Proceedings of the …, 2024 - ojs.aaai.org
Federated Learning (FL) heavily depends on label quality for its performance. However, the
label distribution among individual clients is always both noisy and heterogeneous. The …