Advances, challenges and opportunities in creating data for trustworthy AI

W Liang, GA Tadesse, D Ho, L Fei-Fei… - Nature Machine …, 2022 - nature.com
As artificial intelligence (AI) transitions from research to deployment, creating the appropriate
datasets and data pipelines to develop and evaluate AI models is increasingly the biggest …

Incentive mechanisms for federated learning: From economic and game theoretic perspective

X Tu, K Zhu, NC Luong, D Niyato… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Federated learning (FL) becomes popular and has shown great potentials in training large-
scale machine learning (ML) models without exposing the owners' raw data. In FL, the data …

Foundation models and fair use

P Henderson, X Li, D Jurafsky, T Hashimoto… - Journal of Machine …, 2023 - jmlr.org
Existing foundation models are trained on copyrighted material. Deploying these models
can pose both legal and ethical risks when data creators fail to receive appropriate …

Studying large language model generalization with influence functions

R Grosse, J Bae, C Anil, N Elhage, A Tamkin… - arxiv preprint arxiv …, 2023 - arxiv.org
When trying to gain better visibility into a machine learning model in order to understand and
mitigate the associated risks, a potentially valuable source of evidence is: which training …

The shapley value in machine learning

B Rozemberczki, L Watson, P Bayer… - … Joint Conference on …, 2022 - research.ed.ac.uk
Over the last few years, the Shapley value, a solution concept from cooperative game theory,
has found numerous applications in machine learning. In this paper, we first discuss …

Trak: Attributing model behavior at scale

SM Park, K Georgiev, A Ilyas, G Leclerc… - arxiv preprint arxiv …, 2023 - arxiv.org
The goal of data attribution is to trace model predictions back to training data. Despite a long
line of work towards this goal, existing approaches to data attribution tend to force users to …

Understanding Dataset Difficulty with -Usable Information

K Ethayarajh, Y Choi… - … Conference on Machine …, 2022 - proceedings.mlr.press
Estimating the difficulty of a dataset typically involves comparing state-of-the-art models to
humans; the bigger the performance gap, the harder the dataset is said to be. However, this …

6G-enabled edge AI for metaverse: Challenges, methods, and future research directions

L Chang, Z Zhang, P Li, S **, W Guo… - Journal of …, 2022 - ieeexplore.ieee.org
Sixth generation (6G) enabled edge intelligence opens up a new era of Internet of
everything and makes it possible to interconnect people-devices-cloud anytime, anywhere …

Decentralized edge intelligence: A dynamic resource allocation framework for hierarchical federated learning

WYB Lim, JS Ng, Z **ong, J **, Y Zhang… - … on Parallel and …, 2021 - ieeexplore.ieee.org
To enable the large scale and efficient deployment of Artificial Intelligence (AI), the
confluence of AI and Edge Computing has given rise to Edge Intelligence, which leverages …

A survey of incentive mechanism design for federated learning

Y Zhan, J Zhang, Z Hong, L Wu, P Li… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Federated learning is promising in enabling large-scale machine learning by massive
clients without exposing their raw data. It can not only enable the clients to preserve the …