Pervasive AI for IoT applications: A survey on resource-efficient distributed artificial intelligence

E Baccour, N Mhaisen, AA Abdellatif… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org
Artificial intelligence (AI) has witnessed a substantial breakthrough in a variety of Internet of
Things (IoT) applications and services, spanning from recommendation systems and speech …

On-demand communication for asynchronous multi-agent bandits

YZJ Chen, L Yang, X Wang, X Liu… - International …, 2023 - proceedings.mlr.press
This paper studies a cooperative multi-agent multi-armed stochastic bandit problem where
agents operate asynchronously–agent pull times and rates are unknown, irregular, and …

Differentially private linear bandits with partial distributed feedback

F Li, X Zhou, B Ji - … Symposium on Modeling and Optimization in …, 2022 - ieeexplore.ieee.org
In this paper, we study the problem of global reward maximization with only partial
distributed feedback. This problem is motivated by several real-world applications (eg …

Reinforcement learning for user association and handover in mmwave-enabled networks

A Alizadeh, M Vu - IEEE Transactions on Wireless …, 2022 - ieeexplore.ieee.org
Using a multi-armed bandit technique, we propose centralized and semi-distributed online
algorithms for load balancing user association and handover in mmWave-enabled networks …

Multi-agent best arm identification with private communications

A Rio, M Barlier, I Colin… - … Conference on Machine …, 2023 - proceedings.mlr.press
We address multi-agent best arm identification with privacy guarantees. In this setting,
agents collaborate by communicating to find the optimal arm. To avoid leaking sensitive data …

Decentralized randomly distributed multi-agent multi-armed bandit with heterogeneous rewards

M Xu, D Klabjan - Advances in Neural Information …, 2024 - proceedings.neurips.cc
We study a decentralized multi-agent multi-armed bandit problem in which multiple clients
are connected by time dependent random graphs provided by an environment. The reward …

Collaborative linear bandits with adversarial agents: Near-optimal regret bounds

A Mitra, A Adibi, GJ Pappas… - Advances in neural …, 2022 - proceedings.neurips.cc
We consider a linear stochastic bandit problem involving $ M $ agents that can collaborate
via a central server to minimize regret. A fraction $\alpha $ of these agents are adversarial …