Trak: Attributing model behavior at scale

SM Park, K Georgiev, A Ilyas, G Leclerc… - arxiv preprint arxiv …, 2023 - arxiv.org
The goal of data attribution is to trace model predictions back to training data. Despite a long
line of work towards this goal, existing approaches to data attribution tend to force users to …

Data banzhaf: A robust data valuation framework for machine learning

JT Wang, R Jia - International Conference on Artificial …, 2023 - proceedings.mlr.press
Data valuation has wide use cases in machine learning, including improving data quality
and creating economic incentives for data sharing. This paper studies the robustness of data …

Training data influence analysis and estimation: A survey

Z Hammoudeh, D Lowd - Machine Learning, 2024 - Springer
Good models require good training data. For overparameterized deep models, the causal
relationship between training data and model predictions is increasingly opaque and poorly …

Data-oob: Out-of-bag estimate as a simple and efficient data value

Y Kwon, J Zou - International conference on machine …, 2023 - proceedings.mlr.press
Data valuation is a powerful framework for providing statistical insights into which data are
beneficial or detrimental to model training. Many Shapley-based data valuation methods …

Opendataval: a unified benchmark for data valuation

K Jiang, W Liang, JY Zou… - Advances in Neural …, 2023 - proceedings.neurips.cc
Assessing the quality and impact of individual data points is critical for improving model
performance and mitigating undesirable biases within the training dataset. Several data …

Unleashing the power of data tsunami: A comprehensive survey on data assessment and selection for instruction tuning of language models

Y Qin, Y Yang, P Guo, G Li, H Shao, Y Shi, Z Xu… - arxiv preprint arxiv …, 2024 - arxiv.org
Instruction tuning plays a critical role in aligning large language models (LLMs) with human
preference. Despite the vast amount of open instruction datasets, naively training a LLM on …

A privacy-friendly approach to data valuation

JT Wang, Y Zhu, YX Wang, R Jia… - Advances in Neural …, 2023 - proceedings.neurips.cc
Data valuation, a growing field that aims at quantifying the usefulness of individual data
sources for training machine learning (ML) models, faces notable yet often overlooked …

Rethinking backdoor attacks

A Khaddaj, G Leclerc, A Makelov… - International …, 2023 - proceedings.mlr.press
In a backdoor attack, an adversary inserts maliciously constructed backdoor examples into a
training set to make the resulting model vulnerable to manipulation. Defending against such …

Open problems in technical ai governance

A Reuel, B Bucknall, S Casper, T Fist, L Soder… - arxiv preprint arxiv …, 2024 - arxiv.org
AI progress is creating a growing range of risks and opportunities, but it is often unclear how
they should be navigated. In many cases, the barriers and uncertainties faced are at least …

Intriguing properties of data attribution on diffusion models

X Zheng, T Pang, C Du, J Jiang, M Lin - arxiv preprint arxiv:2311.00500, 2023 - arxiv.org
Data attribution seeks to trace model outputs back to training data. With the recent
development of diffusion models, data attribution has become a desired module to properly …