Generalization guides human exploration in vast decision spaces

CM Wu, E Schulz, M Speekenbrink, JD Nelson… - Nature human …, 2018‏ - nature.com
From foraging for food to learning complex games, many aspects of human behaviour can
be framed as a search problem with a vast space of possible actions. Under finite search …

[کتاب][B] Passivity-based control and estimation in networked robotics

T Hatanaka, N Chopra, M Fujita, MW Spong - 2015‏ - Springer
Passivity is an input–output property of dynamical systems. The concept generalizes
physical systems that cannot store more energy than the energy supplied from outside the …

Time pressure changes how people explore and respond to uncertainty

CM Wu, E Schulz, TJ Pleskac, M Speekenbrink - Scientific reports, 2022‏ - nature.com
How does time pressure influence exploration and decision-making? We investigated this
question with several four-armed bandit tasks manipulating (within subjects) expected …

Fast mmwave beam alignment via correlated bandit learning

W Wu, N Cheng, N Zhang, P Yang… - IEEE Transactions …, 2019‏ - ieeexplore.ieee.org
Beam alignment (BA) is to ensure the transmitter and receiver beams are accurately aligned
to establish a reliable communication link in millimeter-wave (mmwave) systems. Existing …

A survey of online experiment design with the stochastic multi-armed bandit

G Burtini, J Loeppky, R Lawrence - arxiv preprint arxiv:1510.00757, 2015‏ - arxiv.org
Adaptive and sequential experiment design is a well-studied area in numerous domains. We
survey and synthesize the work of the online statistical learning paradigm referred to as multi …

Understanding doctor decision making: The case of depression treatment

JM Currie, WB MacLeod - Econometrica, 2020‏ - Wiley Online Library
Treatment for depression is complex, requiring decisions that may involve trade‐offs
between exploiting treatments with the highest expected value and experimenting with …

Distributed cooperative decision-making in multiarmed bandits: Frequentist and bayesian algorithms

P Landgren, V Srivastava… - 2016 IEEE 55th …, 2016‏ - ieeexplore.ieee.org
We study distributed cooperative decision-making under the explore-exploit tradeoff in the
multiarmed bandit (MAB) problem. We extend state-of-the-art frequentist and Bayesian …

Modeling, replicating, and predicting human behavior: A survey

A Fuchs, A Passarella, M Conti - ACM Transactions on Autonomous and …, 2023‏ - dl.acm.org
Given the popular presupposition of human reasoning as the standard for learning and
decision making, there have been significant efforts and a growing trend in research to …

Online joint bid/daily budget optimization of internet advertising campaigns

A Nuara, F Trovò, N Gatti, M Restelli - Artificial Intelligence, 2022‏ - Elsevier
Pay-per-click advertising includes various formats (eg, search, contextual, social) with a total
investment of more than 200 billion USD per year worldwide. An advertiser is given a daily …

On distributed cooperative decision-making in multiarmed bandits

P Landgren, V Srivastava… - 2016 European Control …, 2016‏ - ieeexplore.ieee.org
We study the explore-exploit tradeoff in distributed cooperative decision-making using the
context of the multiarmed bandit (MAB) problem. For the distributed cooperative MAB …