- Academic Search

Y Ruan, J Yang, Y Zhou - Proceedings of the 53rd Annual ACM SIGACT …, 2021 - dl.acm.org

Motivated by practical needs such as large-scale learning, we study the impact of adaptivity
constraints to linear contextual bandits, a central problem in online learning and decision …

Enregistrer Citer Cité 63 fois Autres articles Les 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Near-optimal collaborative learning in bandits

C Réda, S Vakili, E Kaufmann - Advances in Neural …, 2022 - proceedings.neurips.cc

This paper introduces a general multi-agent bandit model in which each agent is facing a
finite set of arms and may communicate with other agents through a central controller in …

Enregistrer Citer Cité 24 fois Autres articles Les 11 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Communication efficient distributed learning for kernelized contextual bandits

C Li, H Wang, M Wang, H Wang - Advances in Neural …, 2022 - proceedings.neurips.cc

We tackle the communication efficiency challenge of learning kernelized contextual bandits
in a distributed setting. Despite the recent advances in communication-efficient distributed …

Enregistrer Citer Cité 18 fois Autres articles Les 7 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Tight regret bounds for single-pass streaming multi-armed bandits

C Wang - International Conference on Machine Learning, 2023 - proceedings.mlr.press

Regret minimization in streaming multi-armed bandits (MABs) has been studied extensively,
and recent work has shown that algorithms with $ o (K) $ memory have to incur $\Omega …

Enregistrer Citer Cité 8 fois Autres articles Les 6 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

The sample complexity of multi-distribution learning

B Peng - The Thirty Seventh Annual Conference on Learning …, 2024 - proceedings.mlr.press

Multi-distribution learning generalizes the classic PAC learning to handle data coming from
multiple distributions. Given a set of $ k $ data distributions and a hypothesis class of VC …

Enregistrer Citer Cité 8 fois Autres articles Les 2 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Exploiting heterogeneity in robust federated best-arm identification

A Mitra, H Hassani, G Pappas - arxiv preprint arxiv:2109.05700, 2021 - arxiv.org

We study a federated variant of the best-arm identification problem in stochastic multi-armed
bandits: a set of clients, each of whom can sample only a subset of the arms, collaborate via …

Enregistrer Citer Cité 31 fois Autres articles Les 3 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Incentivized communication for federated bandits

Z Wei, C Li, H Xu, H Wang - Advances in Neural …, 2023 - proceedings.neurips.cc

Most existing works on federated bandits take it for granted that all clients are altruistic about
sharing their data with the server for the collective good whenever needed. Despite their …

Enregistrer Citer Cité 3 fois Autres articles Les 5 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Optimal streaming algorithms for multi-armed bandits

T **, K Huang, J Tang, X **ao - International Conference on …, 2021 - proceedings.mlr.press

This paper studies two variants of the best arm identification (BAI) problem under the
streaming model, where we have a stream of n arms with reward distributions supported on …

Enregistrer Citer Cité 22 fois Autres articles Les 4 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Distributed linear bandits under communication constraints

S Salgia, Q Zhao - International Conference on Machine …, 2023 - proceedings.mlr.press

We consider distributed linear bandits where $ M $ agents learn collaboratively to minimize
the overall cumulative regret incurred by all agents. Information exchange is facilitated by a …

Enregistrer Citer Cité 9 fois Autres articles Les 6 versions Free GPT-4 DeepSeek Version HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Collaborative top distribution identifications with limited interaction

N Karpov, Q Zhang, Y Zhou - 2020 IEEE 61st Annual …, 2020 - ieeexplore.ieee.org

We consider the following problem in this paper: given a set of n distributions, find the top-m
ones with the largest means. This problem is also called top-m arm identifications in the …

Enregistrer Citer Cité 28 fois Autres articles Les 8 versions Free GPT-4 DeepSeek

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Collaborative learning with limited interaction: Tight bounds for distributed exploration...

Linear bandits with limited adaptivity and learning distributional optimal design

Near-optimal collaborative learning in bandits

Communication efficient distributed learning for kernelized contextual bandits

Tight regret bounds for single-pass streaming multi-armed bandits

The sample complexity of multi-distribution learning

Exploiting heterogeneity in robust federated best-arm identification

Incentivized communication for federated bandits

Optimal streaming algorithms for multi-armed bandits

Distributed linear bandits under communication constraints

Collaborative top distribution identifications with limited interaction