Linear bandits with limited adaptivity and learning distributional optimal design

Y Ruan, J Yang, Y Zhou - Proceedings of the 53rd Annual ACM SIGACT …, 2021 - dl.acm.org
Motivated by practical needs such as large-scale learning, we study the impact of adaptivity
constraints to linear contextual bandits, a central problem in online learning and decision …

Near-optimal collaborative learning in bandits

C Réda, S Vakili, E Kaufmann - Advances in Neural …, 2022 - proceedings.neurips.cc
This paper introduces a general multi-agent bandit model in which each agent is facing a
finite set of arms and may communicate with other agents through a central controller in …

Communication efficient distributed learning for kernelized contextual bandits

C Li, H Wang, M Wang, H Wang - Advances in Neural …, 2022 - proceedings.neurips.cc
We tackle the communication efficiency challenge of learning kernelized contextual bandits
in a distributed setting. Despite the recent advances in communication-efficient distributed …

Tight regret bounds for single-pass streaming multi-armed bandits

C Wang - International Conference on Machine Learning, 2023 - proceedings.mlr.press
Regret minimization in streaming multi-armed bandits (MABs) has been studied extensively,
and recent work has shown that algorithms with $ o (K) $ memory have to incur $\Omega …

The sample complexity of multi-distribution learning

B Peng - The Thirty Seventh Annual Conference on Learning …, 2024 - proceedings.mlr.press
Multi-distribution learning generalizes the classic PAC learning to handle data coming from
multiple distributions. Given a set of $ k $ data distributions and a hypothesis class of VC …

Exploiting heterogeneity in robust federated best-arm identification

A Mitra, H Hassani, G Pappas - arxiv preprint arxiv:2109.05700, 2021 - arxiv.org
We study a federated variant of the best-arm identification problem in stochastic multi-armed
bandits: a set of clients, each of whom can sample only a subset of the arms, collaborate via …

Incentivized communication for federated bandits

Z Wei, C Li, H Xu, H Wang - Advances in Neural …, 2023 - proceedings.neurips.cc
Most existing works on federated bandits take it for granted that all clients are altruistic about
sharing their data with the server for the collective good whenever needed. Despite their …

Optimal streaming algorithms for multi-armed bandits

T **, K Huang, J Tang, X **ao - International Conference on …, 2021 - proceedings.mlr.press
This paper studies two variants of the best arm identification (BAI) problem under the
streaming model, where we have a stream of n arms with reward distributions supported on …

Distributed linear bandits under communication constraints

S Salgia, Q Zhao - International Conference on Machine …, 2023 - proceedings.mlr.press
We consider distributed linear bandits where $ M $ agents learn collaboratively to minimize
the overall cumulative regret incurred by all agents. Information exchange is facilitated by a …

Collaborative top distribution identifications with limited interaction

N Karpov, Q Zhang, Y Zhou - 2020 IEEE 61st Annual …, 2020 - ieeexplore.ieee.org
We consider the following problem in this paper: given a set of n distributions, find the top-m
ones with the largest means. This problem is also called top-m arm identifications in the …