Introduction to multi-armed bandits

A Slivkins - Foundations and Trends® in Machine Learning, 2019 - nowpublishers.com
Multi-armed bandits a simple but very powerful framework for algorithms that make
decisions over time under uncertainty. An enormous body of work has accumulated over the …

Multi-armed bandits in recommendation systems: A survey of the state-of-the-art and future directions

N Silva, H Werneck, T Silva, ACM Pereira… - Expert Systems with …, 2022 - Elsevier
Abstract Recommender Systems (RSs) have assumed a crucial role in several digital
companies by directly affecting their key performance indicators. Nowadays, in this era of big …

Bayesian exploration: Incentivizing exploration in Bayesian games

Y Mansour, A Slivkins, V Syrgkanis… - Operations …, 2022 - pubsonline.informs.org
We consider a ubiquitous scenario in the internet economy when individual decision makers
(henceforth, agents) both produce and consume information as they make strategic choices …

Bayesian incentive-compatible bandit exploration

Y Mansour, A Slivkins, V Syrgkanis - Proceedings of the Sixteenth ACM …, 2015 - dl.acm.org
Individual decision-makers consume information revealed by the previous decision makers,
and produce information that may help in future decision makers. This phenomenon is …

Ethical aspects of multi-stakeholder recommendation systems

S Milano, M Taddeo, L Floridi - The information society, 2021 - Taylor & Francis
In this article we analyze the ethical aspects of multistakeholder recommendation systems
(RSs). Following the most common approach in the literature, we assume a consequentialist …

Bayesian incentive-compatible bandit exploration

Y Mansour, A Slivkins, V Syrgkanis - Operations Research, 2020 - pubsonline.informs.org
As self-interested individuals (“agents”) make decisions over time, they utilize information
revealed by other agents in the past and produce information that may help agents in the …

Bayesian exploration: Incentivizing exploration in bayesian games

Y Mansour, A Slivkins, V Syrgkanis, ZS Wu - arxiv preprint arxiv …, 2016 - arxiv.org
We consider a ubiquitous scenario in the Internet economy when individual decision-makers
(henceforth, agents) both produce and consume information as they make strategic choices …

Incentivizing exploration with selective data disclosure

N Immorlica, J Mao, A Slivkins, ZS Wu - arxiv preprint arxiv:1811.06026, 2018 - arxiv.org
We propose and design recommendation systems that incentivize efficient exploration.
Agents arrive sequentially, choose actions and receive rewards, drawn from fixed but …

Strategic apple tasting

K Harris, C Podimata, SZ Wu - Advances in Neural …, 2023 - proceedings.neurips.cc
Algorithmic decision-making in high-stakes domains often involves assigning decisions to
agents with incentives to strategically modify their input to the algorithm. In addition to …

Engineering social learning: Information design of time-locked sales campaigns for online platforms

C Küçükgül, Ö Özer, S Wang - Management Science, 2022 - pubsonline.informs.org
Many online platforms offer time-locked sales campaigns, whereby products are sold at fixed
prices for prespecified lengths of time. Platforms often display some information about …