Študovňa Google

Modeling recommender ecosystems: Research challenges at the intersection of mechanism design, reinforcement learning and generative models

C Boutilier, M Mladenov, G Tennenholtz - ar** insert-eliminate algorithm for multi-agent bandits

R Chawla, A Sankararaman… - International …, 2020 - proceedings.mlr.press

We consider a decentralized multi-agent Multi Armed Bandit (MAB) setup consisting of $ N $
agents, solving the same MAB instance to minimize individual cumulative regret. In our …

Uložiť Citovať Citované 56-krát Súvisiace články Všetky verzie 9 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Robust multi-agent multi-armed bandits

D Vial, S Shakkottai, R Srikant - … Design for Mobile Networks and Mobile …, 2021 - dl.acm.org

Recent works have shown that agents facing independent instances of a stochastic K-armed
bandit can collaborate to decrease regret. However, these works assume that each agent …

Uložiť Citovať Citované 47-krát Súvisiace články Všetky verzie 5

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Learning strategies in decentralized matching markets under uncertain preferences

X Dai, MI Jordan - Journal of Machine Learning Research, 2021 - jmlr.org

We study the problem of decision-making in the setting of a scarcity of shared resources
when the preferences of agents are unknown a priori and must be learned from data. Taking …

Uložiť Citovať Citované 33-krát Súvisiace články Všetky verzie 8 Vyhľadávanie knižnice HTML verzia

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Competing bandits in matching markets

Modeling recommender ecosystems: Research challenges at the intersection of mechanism design, reinforcement learning and generative models

Robust multi-agent multi-armed bandits

Learning strategies in decentralized matching markets under uncertain preferences