- Academic Search

A Slivkins - Foundations and Trends® in Machine Learning, 2019 - nowpublishers.com

Multi-armed bandits a simple but very powerful framework for algorithms that make
decisions over time under uncertainty. An enormous body of work has accumulated over the …

Save Cite Cited by 1253 Related articles All 7 versions Free GPT-4 Library Search View as HTML

[Free GPT-4]

[PDF] aaai.org

Federated multi-armed bandits

C Shi, C Shen - Proceedings of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org

Federated multi-armed bandits (FMAB) is a new bandit paradigm that parallels the federated
learning (FL) framework in supervised learning. It is inspired by practical applications in …

Save Cite Cited by 101 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] jmlr.org

Bandit learning in decentralized matching markets

LT Liu, F Ruan, H Mania, MI Jordan - Journal of Machine Learning …, 2021 - jmlr.org

We study two-sided matching markets in which one side of the market (the players) does not
have a priori knowledge about its preferences for the other side (the arms) and is required to …

Save Cite Cited by 84 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

SIC-MMAB: Synchronisation involves communication in multiplayer multi-armed bandits

E Boursier, V Perchet - Advances in Neural Information …, 2019 - proceedings.neurips.cc

Motivated by cognitive radio networks, we consider the stochastic multiplayer multi-armed
bandit problem, where several players pull arms simultaneously and collisions occur if one …

Save Cite Cited by 117 Related articles All 14 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Cooperative stochastic bandits with asynchronous agents and constrained feedback

L Yang, YZJ Chen, S Pasteris… - Advances in …, 2021 - proceedings.neurips.cc

This paper studies a cooperative multi-armed bandit problem with $ M $ agents cooperating
together to solve the same instance of a $ K $-armed stochastic bandit problem with the goal …

Save Cite Cited by 31 Related articles All 13 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Heterogeneous multi-player multi-armed bandits: Closing the gap and generalization

C Shi, W **ong, C Shen, J Yang - Advances in neural …, 2021 - proceedings.neurips.cc

Despite the significant interests and many progresses in decentralized multi-player multi-
armed bandits (MP-MAB) problems in recent years, the regret gap to the natural centralized …

Save Cite Cited by 25 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Regret, stability & fairness in matching markets with bandit learners

SH Cen, D Shah - International Conference on Artificial …, 2022 - proceedings.mlr.press

Making an informed decision—for example, when choosing a career or housing—requires
knowledge about the available options. Such knowledge is generally acquired through …

Save Cite Cited by 43 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Cooperative multi-agent bandits with heavy tails

A Dubey - International conference on machine learning, 2020 - proceedings.mlr.press

We study the heavy-tailed stochastic bandit problem in the cooperative multi-agent setting,
where a group of agents interact with a common bandit problem, while communicating on a …

Save Cite Cited by 56 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Multiplayer bandits without observing collision information

G Lugosi, A Mehrabian - Mathematics of Operations …, 2022 - pubsonline.informs.org

We study multiplayer stochastic multiarmed bandit problems in which the players cannot
communicate, and if two or more players pull the same arm, a collision occurs and the …

Save Cite Cited by 55 Related articles All 6 versions Free GPT-4

[BOOK][B] Multi-armed bandits: Theory and applications to online learning in networks

Q Zhao - 2019 - books.google.com

Multi-armed bandit problems pertain to optimal sequential decision making and learning in
unknown environments. Since the first bandit problem posed by Thompson in 1933 for the …

Save Cite Cited by 47 Related articles All 4 versions Free GPT-4 Library Search

Create alert

Cite

Advanced search

Saved to My library

Non-stochastic multi-player multi-armed bandits: Optimal rate with collision information,...

Introduction to multi-armed bandits

Federated multi-armed bandits

Bandit learning in decentralized matching markets

SIC-MMAB: Synchronisation involves communication in multiplayer multi-armed bandits

Cooperative stochastic bandits with asynchronous agents and constrained feedback

Heterogeneous multi-player multi-armed bandits: Closing the gap and generalization

Regret, stability & fairness in matching markets with bandit learners

Cooperative multi-agent bandits with heavy tails

Multiplayer bandits without observing collision information

[BOOK][B] Multi-armed bandits: Theory and applications to online learning in networks