Data pricing in machine learning pipelines

Z Cong, X Luo, J Pei, F Zhu, Y Zhang - Knowledge and Information …, 2022 - Springer
Abstract Machine learning is disruptive. At the same time, machine learning can only
succeed by collaboration among many parties in multiple steps naturally as pipelines in an …

Online class-incremental continual learning with adversarial shapley value

D Shim, Z Mai, J Jeong, S Sanner, H Kim… - Proceedings of the AAAI …, 2021 - ojs.aaai.org
As image-based deep learning becomes pervasive on every device, from cell phones to
smart watches, there is a growing need to develop methods that continually learn from data …

A principled approach to data valuation for federated learning

T Wang, J Rausch, C Zhang, R Jia, D Song - Federated Learning: Privacy …, 2020 - Springer
Federated learning (FL) is a popular technique to train machine learning (ML) models on
decentralized data sources. In order to sustain long-term participation of data owners, it is …

Data banzhaf: A robust data valuation framework for machine learning

JT Wang, R Jia - International Conference on Artificial …, 2023 - proceedings.mlr.press
Data valuation has wide use cases in machine learning, including improving data quality
and creating economic incentives for data sharing. This paper studies the robustness of data …

[HTML][HTML] Data-driven learning for data rights, data pricing, and privacy computing

J Xu, N Hong, Z Xu, Z Zhao, C Wu, K Kuang, J Wang… - Engineering, 2023 - Elsevier
In recent years, data has become one of the most important resources in the digital
economy. Unlike traditional resources, the digital nature of data makes it difficult to value …

Davinz: Data valuation using deep neural networks at initialization

Z Wu, Y Shu, BKH Low - International Conference on …, 2022 - proceedings.mlr.press
Recent years have witnessed a surge of interest in develo** trustworthy methods to
evaluate the value of data in many real-world applications (eg, collaborative machine …

Beta shapley: a unified and noise-reduced data valuation framework for machine learning

Y Kwon, J Zou - arxiv preprint arxiv:2110.14049, 2021 - arxiv.org
Data Shapley has recently been proposed as a principled framework to quantify the
contribution of individual datum in machine learning. It can effectively identify helpful or …

Profit allocation for federated learning

T Song, Y Tong, S Wei - … Conference on Big Data (Big Data), 2019 - ieeexplore.ieee.org
Due to stricter data management regulations such as General Data Protection Regulation
(GDPR), traditional production mode of machine learning services is shifting to federated …

A survey on data pricing: from economics to data science

J Pei - IEEE Transactions on knowledge and Data …, 2020 - ieeexplore.ieee.org
Data are invaluable. How can we assess the value of data objectively, systematically and
quantitatively? Pricing data, or information goods in general, has been studied and practiced …

Training data influence analysis and estimation: A survey

Z Hammoudeh, D Lowd - Machine Learning, 2024 - Springer
Good models require good training data. For overparameterized deep models, the causal
relationship between training data and model predictions is increasingly opaque and poorly …