محقق Google

CM Wu, E Schulz, M Speekenbrink, JD Nelson… - Nature human …, 2018‏ - nature.com‏

From foraging for food to learning complex games, many aspects of human behaviour can
be framed as a search problem with a vast space of possible actions. Under finite search …‏

ذخیره ارجاع بیان شده در 269 یافته مقاله‌های مربوط تمام نسخه‌های 17

[کتاب][B] Passivity-based control and estimation in networked robotics‏

T Hatanaka, N Chopra, M Fujita, MW Spong - 2015‏ - Springer‏

Passivity is an input–output property of dynamical systems. The concept generalizes
physical systems that cannot store more energy than the energy supplied from outside the …‏

ذخیره ارجاع بیان شده در 271 یافته مقاله‌های مربوط تمام نسخه‌های 7

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Time pressure changes how people explore and respond to uncertainty‏

CM Wu, E Schulz, TJ Pleskac, M Speekenbrink - Scientific reports, 2022‏ - nature.com‏

How does time pressure influence exploration and decision-making? We investigated this
question with several four-armed bandit tasks manipulating (within subjects) expected …‏

ذخیره ارجاع بیان شده در 71 یافته مقاله‌های مربوط تمام نسخه‌های 20

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fast mmwave beam alignment via correlated bandit learning‏

W Wu, N Cheng, N Zhang, P Yang… - IEEE Transactions …, 2019‏ - ieeexplore.ieee.org‏

Beam alignment (BA) is to ensure the transmitter and receiver beams are accurately aligned
to establish a reliable communication link in millimeter-wave (mmwave) systems. Existing …‏

ذخیره ارجاع بیان شده در 135 یافته مقاله‌های مربوط تمام نسخه‌های 4

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of online experiment design with the stochastic multi-armed bandit‏

G Burtini, J Loeppky, R Lawrence - arxiv preprint arxiv:1510.00757, 2015‏ - arxiv.org‏

Adaptive and sequential experiment design is a well-studied area in numerous domains. We
survey and synthesize the work of the online statistical learning paradigm referred to as multi …‏

ذخیره ارجاع بیان شده در 168 یافته مقاله‌های مربوط تمام نسخه‌های 3 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] nih.gov

Understanding doctor decision making: The case of depression treatment‏

JM Currie, WB MacLeod - Econometrica, 2020‏ - Wiley Online Library‏

Treatment for depression is complex, requiring decisions that may involve trade‐offs
between exploiting treatments with the highest expected value and experimenting with …‏

ذخیره ارجاع بیان شده در 100 یافته مقاله‌های مربوط تمام نسخه‌های 12

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Distributed cooperative decision-making in multiarmed bandits: Frequentist and bayesian algorithms‏

P Landgren, V Srivastava… - 2016 IEEE 55th …, 2016‏ - ieeexplore.ieee.org‏

We study distributed cooperative decision-making under the explore-exploit tradeoff in the
multiarmed bandit (MAB) problem. We extend state-of-the-art frequentist and Bayesian …‏

ذخیره ارجاع بیان شده در 134 یافته مقاله‌های مربوط تمام نسخه‌های 11

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Modeling, replicating, and predicting human behavior: A survey‏

A Fuchs, A Passarella, M Conti - ACM Transactions on Autonomous and …, 2023‏ - dl.acm.org‏

Given the popular presupposition of human reasoning as the standard for learning and
decision making, there have been significant efforts and a growing trend in research to …‏

ذخیره ارجاع بیان شده در 23 یافته مقاله‌های مربوط تمام نسخه‌های 4

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Online joint bid/daily budget optimization of internet advertising campaigns‏

A Nuara, F Trovò, N Gatti, M Restelli - Artificial Intelligence, 2022‏ - Elsevier‏

Pay-per-click advertising includes various formats (eg, search, contextual, social) with a total
investment of more than 200 billion USD per year worldwide. An advertiser is given a daily …‏

ذخیره ارجاع بیان شده در 52 یافته مقاله‌های مربوط تمام نسخه‌های 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On distributed cooperative decision-making in multiarmed bandits‏

P Landgren, V Srivastava… - 2016 European Control …, 2016‏ - ieeexplore.ieee.org‏

We study the explore-exploit tradeoff in distributed cooperative decision-making using the
context of the multiarmed bandit (MAB) problem. For the distributed cooperative MAB …‏

ذخیره ارجاع بیان شده در 91 یافته مقاله‌های مربوط تمام نسخه‌های 9

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

Modeling human decision making in generalized Gaussian multiarmed bandits

Generalization guides human exploration in vast decision spaces‏

[کتاب][B] Passivity-based control and estimation in networked robotics‏

Time pressure changes how people explore and respond to uncertainty‏

Fast mmwave beam alignment via correlated bandit learning‏

A survey of online experiment design with the stochastic multi-armed bandit‏

Understanding doctor decision making: The case of depression treatment‏

Distributed cooperative decision-making in multiarmed bandits: Frequentist and bayesian algorithms‏

Modeling, replicating, and predicting human behavior: A survey‏

Online joint bid/daily budget optimization of internet advertising campaigns‏

On distributed cooperative decision-making in multiarmed bandits‏