- Academic Search

SM Park, K Georgiev, A Ilyas, G Leclerc… - arxiv preprint arxiv …, 2023 - arxiv.org

The goal of data attribution is to trace model predictions back to training data. Despite a long
line of work towards this goal, existing approaches to data attribution tend to force users to …

Lagre Referanse Sitert av 143 Beslektede artikler Alle 6 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Data banzhaf: A robust data valuation framework for machine learning

JT Wang, R Jia - International Conference on Artificial …, 2023 - proceedings.mlr.press

Data valuation has wide use cases in machine learning, including improving data quality
and creating economic incentives for data sharing. This paper studies the robustness of data …

Lagre Referanse Sitert av 108 Beslektede artikler Alle 5 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Training data influence analysis and estimation: A survey

Z Hammoudeh, D Lowd - Machine Learning, 2024 - Springer

Good models require good training data. For overparameterized deep models, the causal
relationship between training data and model predictions is increasingly opaque and poorly …

Lagre Referanse Sitert av 89 Beslektede artikler Alle 5 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Data-oob: Out-of-bag estimate as a simple and efficient data value

Y Kwon, J Zou - International conference on machine …, 2023 - proceedings.mlr.press

Data valuation is a powerful framework for providing statistical insights into which data are
beneficial or detrimental to model training. Many Shapley-based data valuation methods …

Lagre Referanse Sitert av 38 Beslektede artikler Alle 7 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Opendataval: a unified benchmark for data valuation

K Jiang, W Liang, JY Zou… - Advances in Neural …, 2023 - proceedings.neurips.cc

Assessing the quality and impact of individual data points is critical for improving model
performance and mitigating undesirable biases within the training dataset. Several data …

Lagre Referanse Sitert av 35 Beslektede artikler Alle 5 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Unleashing the power of data tsunami: A comprehensive survey on data assessment and selection for instruction tuning of language models

Y Qin, Y Yang, P Guo, G Li, H Shao, Y Shi, Z Xu… - arxiv preprint arxiv …, 2024 - arxiv.org

Instruction tuning plays a critical role in aligning large language models (LLMs) with human
preference. Despite the vast amount of open instruction datasets, naively training a LLM on …

Lagre Referanse Sitert av 7 Beslektede artikler Alle 3 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

A privacy-friendly approach to data valuation

JT Wang, Y Zhu, YX Wang, R Jia… - Advances in Neural …, 2023 - proceedings.neurips.cc

Data valuation, a growing field that aims at quantifying the usefulness of individual data
sources for training machine learning (ML) models, faces notable yet often overlooked …

Lagre Referanse Sitert av 23 Beslektede artikler Alle 9 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Rethinking backdoor attacks

A Khaddaj, G Leclerc, A Makelov… - International …, 2023 - proceedings.mlr.press

In a backdoor attack, an adversary inserts maliciously constructed backdoor examples into a
training set to make the resulting model vulnerable to manipulation. Defending against such …

Lagre Referanse Sitert av 19 Beslektede artikler Alle 8 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Open problems in technical ai governance

A Reuel, B Bucknall, S Casper, T Fist, L Soder… - arxiv preprint arxiv …, 2024 - arxiv.org

AI progress is creating a growing range of risks and opportunities, but it is often unclear how
they should be navigated. In many cases, the barriers and uncertainties faced are at least …

Lagre Referanse Sitert av 27 Beslektede artikler Alle 4 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Intriguing properties of data attribution on diffusion models

X Zheng, T Pang, C Du, J Jiang, M Lin - arxiv preprint arxiv:2311.00500, 2023 - arxiv.org

Data attribution seeks to trace model outputs back to training data. With the recent
development of diffusion models, data attribution has become a desired module to properly …

Lagre Referanse Sitert av 25 Beslektede artikler Alle 4 versjoner HTML-versjon

Opprett varsel

Referanse

Avansert søk

Lagret i Mitt bibliotek

Measuring the effect of training data on deep learning predictions via randomized experiments

Trak: Attributing model behavior at scale

Data banzhaf: A robust data valuation framework for machine learning

Training data influence analysis and estimation: A survey

Data-oob: Out-of-bag estimate as a simple and efficient data value

Opendataval: a unified benchmark for data valuation

Unleashing the power of data tsunami: A comprehensive survey on data assessment and selection for instruction tuning of language models

A privacy-friendly approach to data valuation

Rethinking backdoor attacks

Open problems in technical ai governance

Intriguing properties of data attribution on diffusion models