Linear bandits with limited adaptivity and learning distributional optimal design
Motivated by practical needs such as large-scale learning, we study the impact of adaptivity
constraints to linear contextual bandits, a central problem in online learning and decision …
constraints to linear contextual bandits, a central problem in online learning and decision …
Near-optimal collaborative learning in bandits
This paper introduces a general multi-agent bandit model in which each agent is facing a
finite set of arms and may communicate with other agents through a central controller in …
finite set of arms and may communicate with other agents through a central controller in …
Communication efficient distributed learning for kernelized contextual bandits
We tackle the communication efficiency challenge of learning kernelized contextual bandits
in a distributed setting. Despite the recent advances in communication-efficient distributed …
in a distributed setting. Despite the recent advances in communication-efficient distributed …
Tight regret bounds for single-pass streaming multi-armed bandits
C Wang - International Conference on Machine Learning, 2023 - proceedings.mlr.press
Regret minimization in streaming multi-armed bandits (MABs) has been studied extensively,
and recent work has shown that algorithms with $ o (K) $ memory have to incur $\Omega …
and recent work has shown that algorithms with $ o (K) $ memory have to incur $\Omega …
The sample complexity of multi-distribution learning
B Peng - The Thirty Seventh Annual Conference on Learning …, 2024 - proceedings.mlr.press
Multi-distribution learning generalizes the classic PAC learning to handle data coming from
multiple distributions. Given a set of $ k $ data distributions and a hypothesis class of VC …
multiple distributions. Given a set of $ k $ data distributions and a hypothesis class of VC …
Exploiting heterogeneity in robust federated best-arm identification
We study a federated variant of the best-arm identification problem in stochastic multi-armed
bandits: a set of clients, each of whom can sample only a subset of the arms, collaborate via …
bandits: a set of clients, each of whom can sample only a subset of the arms, collaborate via …
Incentivized communication for federated bandits
Most existing works on federated bandits take it for granted that all clients are altruistic about
sharing their data with the server for the collective good whenever needed. Despite their …
sharing their data with the server for the collective good whenever needed. Despite their …
Optimal streaming algorithms for multi-armed bandits
This paper studies two variants of the best arm identification (BAI) problem under the
streaming model, where we have a stream of n arms with reward distributions supported on …
streaming model, where we have a stream of n arms with reward distributions supported on …
Distributed linear bandits under communication constraints
We consider distributed linear bandits where $ M $ agents learn collaboratively to minimize
the overall cumulative regret incurred by all agents. Information exchange is facilitated by a …
the overall cumulative regret incurred by all agents. Information exchange is facilitated by a …
Collaborative top distribution identifications with limited interaction
We consider the following problem in this paper: given a set of n distributions, find the top-m
ones with the largest means. This problem is also called top-m arm identifications in the …
ones with the largest means. This problem is also called top-m arm identifications in the …