A comprehensive survey on deep active learning in medical image analysis

H Wang, Q **, S Li, S Liu, M Wang, Z Song - Medical Image Analysis, 2024 - Elsevier
Deep learning has achieved widespread success in medical image analysis, leading to an
increasing demand for large-scale expert-annotated medical image datasets. Yet, the high …

Trak: Attributing model behavior at scale

SM Park, K Georgiev, A Ilyas, G Leclerc… - arxiv preprint arxiv …, 2023 - arxiv.org
The goal of data attribution is to trace model predictions back to training data. Despite a long
line of work towards this goal, existing approaches to data attribution tend to force users to …

Training data influence analysis and estimation: A survey

Z Hammoudeh, D Lowd - Machine Learning, 2024 - Springer
Good models require good training data. For overparameterized deep models, the causal
relationship between training data and model predictions is increasingly opaque and poorly …

On the need for a language describing distribution shifts: Illustrations on tabular datasets

J Liu, T Wang, P Cui… - Advances in Neural …, 2023 - proceedings.neurips.cc
Different distribution shifts require different algorithmic and operational interventions.
Methodological research must be grounded by the specific shifts they address. Although …

Active finetuning: Exploiting annotation budget in the pretraining-finetuning paradigm

Y **e, H Lu, J Yan, X Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Given the large-scale data and the high annotation cost, pretraining-finetuning becomes a
popular paradigm in multiple computer vision tasks. Previous research has covered both the …

What is your data worth to gpt? llm-scale data valuation with influence functions

SK Choe, H Ahn, J Bae, K Zhao, M Kang… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) are trained on a vast amount of human-written data, but data
providers often remain uncredited. In response to this issue, data valuation (or data …

Divide and adapt: Active domain adaptation via customized learning

D Huang, J Li, W Chen, J Huang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Active domain adaptation (ADA) aims to improve the model adaptation performance by
incorporating the active learning (AL) techniques to label a maximally-informative subset of …

Kecor: Kernel coding rate maximization for active 3d object detection

Y Luo, Z Chen, Z Fang, Z Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Achieving a reliable LiDAR-based object detector in autonomous driving is paramount, but
its success hinges on obtaining large amounts of precise 3D annotations. Active learning …

Intriguing properties of data attribution on diffusion models

X Zheng, T Pang, C Du, J Jiang, M Lin - arxiv preprint arxiv:2311.00500, 2023 - arxiv.org
Data attribution seeks to trace model outputs back to training data. With the recent
development of diffusion models, data attribution has become a desired module to properly …

Meta agent teaming active learning for pose estimation

J Gong, Z Fan, Q Ke, H Rahmani… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
The existing pose estimation approaches often require a large number of annotated images
to attain good estimation performance, which are laborious to acquire. To reduce the human …