- Academic Search

Cooperative and stochastic multi-player multi-armed bandit: Optimal regret with neither communica...

LT Liu, F Ruan, H Mania, MI Jordan - Journal of Machine Learning …, 2021 - jmlr.org

We study two-sided matching markets in which one side of the market (the players) does not
have a priori knowledge about its preferences for the other side (the arms) and is required to …

Save Cite Cited by 84 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Efficient decentralized multi-agent learning in asymmetric queuing systems

D Freund, T Lykouris, W Weng - Conference on Learning …, 2022 - proceedings.mlr.press

We study decentralized multi-agent learning in bipartite queuing systems, a standard model
for service systems. In particular, N agents request service from K servers in a fully …

Save Cite Cited by 18 Related articles View as HTML

[Free GPT-4]

[PDF] neurips.cc

Heterogeneous multi-player multi-armed bandits: Closing the gap and generalization

C Shi, W **ong, C Shen, J Yang - Advances in neural …, 2021 - proceedings.neurips.cc

Despite the significant interests and many progresses in decentralized multi-player multi-
armed bandits (MP-MAB) problems in recent years, the regret gap to the natural centralized …

Save Cite Cited by 25 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Decentralized learning in online queuing systems

F Sentenac, E Boursier… - Advances in Neural …, 2021 - proceedings.neurips.cc

Motivated by packet routing in computer networks, online queuing systems are composed of
queues receiving packets at different rates. Repeatedly, they send packets to servers, each …

Save Cite Cited by 28 Related articles All 10 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Decentralized cooperative reinforcement learning with hierarchical information structure

H Kao, CY Wei, V Subramanian - … Conference on Algorithmic …, 2022 - proceedings.mlr.press

Multi-agent reinforcement learning (MARL) problems are challenging due to information
asymmetry. To overcome this challenge, existing methods often require high level of …

Save Cite Cited by 19 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Multiplayer bandits without observing collision information

G Lugosi, A Mehrabian - Mathematics of Operations …, 2022 - pubsonline.informs.org

We study multiplayer stochastic multiarmed bandit problems in which the players cannot
communicate, and if two or more players pull the same arm, a collision occurs and the …

Save Cite Cited by 55 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] aaai.org

Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility

F Kong, S Li - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

Two-sided matching markets have been widely studied in the literature due to their rich
applications. Since participants are usually uncertain about their preferences, online …

Save Cite Cited by 4 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Decentralized, communication-and coordination-free learning in structured matching markets

C Maheshwari, S Sastry… - Advances in Neural …, 2022 - proceedings.neurips.cc

We study the problem of online learning in competitive settings in the context of two-sided
matching markets. In particular, one side of the market, the agents, must learn about their …

Save Cite Cited by 16 Related articles All 13 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] ieee.org

Multi-player multi-armed bandits with collision-dependent reward distributions

C Shi, C Shen - IEEE Transactions on Signal Processing, 2021 - ieeexplore.ieee.org

We study a new stochastic multi-player multi-armed bandits (MP-MAB) problem, where the
reward distribution changes if a collision occurs on the arm. Existing literature always …

Save Cite Cited by 15 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Towards optimal algorithms for multi-player bandits without collision sensing information

W Huang, R Combes, C Trinh - Conference on Learning …, 2022 - proceedings.mlr.press

We propose a novel algorithm for multi-player multi-armed bandits without collision sensing
information. Our algorithm circumvents two problems shared by all state-of-the-art …

Save Cite Cited by 14 Related articles All 6 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Cooperative and stochastic multi-player multi-armed bandit: Optimal regret with neither communica...

Bandit learning in decentralized matching markets

Efficient decentralized multi-agent learning in asymmetric queuing systems

Heterogeneous multi-player multi-armed bandits: Closing the gap and generalization

Decentralized learning in online queuing systems

Decentralized cooperative reinforcement learning with hierarchical information structure

Multiplayer bandits without observing collision information

Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility

Decentralized, communication-and coordination-free learning in structured matching markets

Multi-player multi-armed bandits with collision-dependent reward distributions

Towards optimal algorithms for multi-player bandits without collision sensing information