Beyond images: Label noise transition matrix estimation for tasks with lower-quality features

Z Zhu, J Wang, Y Liu - International Conference on Machine …, 2022 - proceedings.mlr.press
The label noise transition matrix, denoting the transition probabilities from clean labels to
noisy labels, is crucial for designing statistically robust solutions. Existing estimators for …

Mitigating memorization of noisy labels by clip** the model prediction

H Wei, H Zhuang, R **e, L Feng… - … on machine learning, 2023 - proceedings.mlr.press
In the presence of noisy labels, designing robust loss functions is critical for securing the
generalization performance of deep neural networks. Cross Entropy (CE) loss has been …

Unmasking and improving data credibility: A study with datasets for training harmless language models

Z Zhu, J Wang, H Cheng, Y Liu - arxiv preprint arxiv:2311.11202, 2023 - arxiv.org
Language models have shown promise in various tasks but can be affected by undesired
data during training, fine-tuning, or alignment. For example, if some unsafe conversations …

Admoe: Anomaly detection with mixture-of-experts from noisy labels

Y Zhao, G Zheng, S Mukherjee, R McCann… - Proceedings of the …, 2023 - ojs.aaai.org
Existing works on anomaly detection (AD) rely on clean labels from human annotators that
are expensive to acquire in practice. In this work, we propose a method to leverage …

Coupled confusion correction: Learning from crowds with sparse annotations

H Zhang, S Li, D Zeng, C Yan, S Ge - Proceedings of the AAAI …, 2024 - ojs.aaai.org
As the size of the datasets getting larger, accurately annotating such datasets is becoming
more impractical due to the expensiveness on both time and economy. Therefore, crowd …

To aggregate or not? learning with separate noisy labels

J Wei, Z Zhu, T Luo, E Amid, A Kumar… - Proceedings of the 29th …, 2023 - dl.acm.org
The rawly collected training data often comes with separate noisy labels collected from
multiple imperfect annotators (eg, via crowdsourcing). A typical way of using these separate …

Weak proxies are sufficient and preferable for fairness with missing sensitive attributes

Z Zhu, Y Yao, J Sun, H Li, Y Liu - … Conference on Machine …, 2023 - proceedings.mlr.press
Evaluating fairness can be challenging in practice because the sensitive attributes of data
are often inaccessible due to privacy constraints. The go-to approach that the industry …

Deep learning from crowdsourced labels: Coupled cross-entropy minimization, identifiability, and regularization

S Ibrahim, T Nguyen, X Fu - arxiv preprint arxiv:2306.03288, 2023 - arxiv.org
Using noisy crowdsourced labels from multiple annotators, a deep learning-based end-to-
end (E2E) system aims to learn the label correction mechanism and the neural classifier …

Transferring annotator-and instance-dependent transition matrix for learning from crowds

S Li, X **a, J Deng, S Gey, T Liu - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Learning from crowds describes that the annotations of training data are obtained with
crowd-sourcing services. Multiple annotators each complete their own small part of the …

Learning from crowds with annotation reliability

Z Cao, E Chen, Y Huang, S Shen… - Proceedings of the 46th …, 2023 - dl.acm.org
Crowdsourcing provides a practical approach for obtaining annotated data to train
supervised learning models. However, since the crowd annotators may have different …