Bandit learning in decentralized matching markets
We study two-sided matching markets in which one side of the market (the players) does not
have a priori knowledge about its preferences for the other side (the arms) and is required to …
have a priori knowledge about its preferences for the other side (the arms) and is required to …
Efficient decentralized multi-agent learning in asymmetric queuing systems
We study decentralized multi-agent learning in bipartite queuing systems, a standard model
for service systems. In particular, N agents request service from K servers in a fully …
for service systems. In particular, N agents request service from K servers in a fully …
Heterogeneous multi-player multi-armed bandits: Closing the gap and generalization
Despite the significant interests and many progresses in decentralized multi-player multi-
armed bandits (MP-MAB) problems in recent years, the regret gap to the natural centralized …
armed bandits (MP-MAB) problems in recent years, the regret gap to the natural centralized …
Decentralized learning in online queuing systems
Motivated by packet routing in computer networks, online queuing systems are composed of
queues receiving packets at different rates. Repeatedly, they send packets to servers, each …
queues receiving packets at different rates. Repeatedly, they send packets to servers, each …
Decentralized cooperative reinforcement learning with hierarchical information structure
Multi-agent reinforcement learning (MARL) problems are challenging due to information
asymmetry. To overcome this challenge, existing methods often require high level of …
asymmetry. To overcome this challenge, existing methods often require high level of …
Multiplayer bandits without observing collision information
We study multiplayer stochastic multiarmed bandit problems in which the players cannot
communicate, and if two or more players pull the same arm, a collision occurs and the …
communicate, and if two or more players pull the same arm, a collision occurs and the …
Improved Bandits in Many-to-One Matching Markets with Incentive Compatibility
Two-sided matching markets have been widely studied in the literature due to their rich
applications. Since participants are usually uncertain about their preferences, online …
applications. Since participants are usually uncertain about their preferences, online …
Decentralized, communication-and coordination-free learning in structured matching markets
We study the problem of online learning in competitive settings in the context of two-sided
matching markets. In particular, one side of the market, the agents, must learn about their …
matching markets. In particular, one side of the market, the agents, must learn about their …
Multi-player multi-armed bandits with collision-dependent reward distributions
We study a new stochastic multi-player multi-armed bandits (MP-MAB) problem, where the
reward distribution changes if a collision occurs on the arm. Existing literature always …
reward distribution changes if a collision occurs on the arm. Existing literature always …
Towards optimal algorithms for multi-player bandits without collision sensing information
We propose a novel algorithm for multi-player multi-armed bandits without collision sensing
information. Our algorithm circumvents two problems shared by all state-of-the-art …
information. Our algorithm circumvents two problems shared by all state-of-the-art …