Bandit learning in decentralized matching markets

LT Liu, F Ruan, H Mania, MI Jordan - Journal of Machine Learning …, 2021 - jmlr.org
We study two-sided matching markets in which one side of the market (the players) does not
have a priori knowledge about its preferences for the other side (the arms) and is required to …

Efficient decentralized multi-agent learning in asymmetric queuing systems

D Freund, T Lykouris, W Weng - Conference on Learning …, 2022 - proceedings.mlr.press
We study decentralized multi-agent learning in bipartite queuing systems, a standard model
for service systems. In particular, N agents request service from K servers in a fully …

Heterogeneous multi-player multi-armed bandits: Closing the gap and generalization

C Shi, W **ong, C Shen, J Yang - Advances in neural …, 2021 - proceedings.neurips.cc
Despite the significant interests and many progresses in decentralized multi-player multi-
armed bandits (MP-MAB) problems in recent years, the regret gap to the natural centralized …

Decentralized learning in online queuing systems

F Sentenac, E Boursier… - Advances in Neural …, 2021 - proceedings.neurips.cc
Motivated by packet routing in computer networks, online queuing systems are composed of
queues receiving packets at different rates. Repeatedly, they send packets to servers, each …

Decentralized cooperative reinforcement learning with hierarchical information structure

H Kao, CY Wei, V Subramanian - … Conference on Algorithmic …, 2022 - proceedings.mlr.press
Multi-agent reinforcement learning (MARL) problems are challenging due to information
asymmetry. To overcome this challenge, existing methods often require high level of …

Multiplayer bandits without observing collision information

G Lugosi, A Mehrabian - Mathematics of Operations …, 2022 - pubsonline.informs.org
We study multiplayer stochastic multiarmed bandit problems in which the players cannot
communicate, and if two or more players pull the same arm, a collision occurs and the …

Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility

F Kong, S Li - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Two-sided matching markets have been widely studied in the literature due to their rich
applications. Since participants are usually uncertain about their preferences, online …

Decentralized, communication-and coordination-free learning in structured matching markets

C Maheshwari, S Sastry… - Advances in Neural …, 2022 - proceedings.neurips.cc
We study the problem of online learning in competitive settings in the context of two-sided
matching markets. In particular, one side of the market, the agents, must learn about their …

Multi-player multi-armed bandits with collision-dependent reward distributions

C Shi, C Shen - IEEE Transactions on Signal Processing, 2021 - ieeexplore.ieee.org
We study a new stochastic multi-player multi-armed bandits (MP-MAB) problem, where the
reward distribution changes if a collision occurs on the arm. Existing literature always …

Towards optimal algorithms for multi-player bandits without collision sensing information

W Huang, R Combes, C Trinh - Conference on Learning …, 2022 - proceedings.mlr.press
We propose a novel algorithm for multi-player multi-armed bandits without collision sensing
information. Our algorithm circumvents two problems shared by all state-of-the-art …