Академия Google

AV Den Boer - Surveys in operations research and management …, 2015 - Elsevier

The topic of dynamic pricing and learning has received a considerable amount of attention
in recent years, from different scientific communities. We survey these literature streams: we …

Сохранить Цитировать Цитируется: 633 Похожие статьи Все версии статьи (13) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] tor-lattimore.com

[КНИГА][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com

Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

Сохранить Цитировать Цитируется: 3299 Похожие статьи Все версии статьи (9) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Introduction to multi-armed bandits

A Slivkins - Foundations and Trends® in Machine Learning, 2019 - nowpublishers.com

Multi-armed bandits a simple but very powerful framework for algorithms that make
decisions over time under uncertainty. An enormous body of work has accumulated over the …

Сохранить Цитировать Цитируется: 1256 Похожие статьи Все версии статьи (7) Поиск в библиотеках В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] ssrn.com

Online decision making with high-dimensional covariates

H Bastani, M Bayati - Operations Research, 2020 - pubsonline.informs.org

Big data have enabled decision makers to tailor decisions at the individual level in a variety
of domains, such as personalized medicine and online advertising. Doing so involves …

Сохранить Цитировать Цитируется: 623 Похожие статьи Все версии статьи (12) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Beyond ucb: Optimal and efficient contextual bandits with regression oracles

D Foster, A Rakhlin - International Conference on Machine …, 2020 - proceedings.mlr.press

A fundamental challenge in contextual bandits is to develop flexible, general-purpose
algorithms with computational requirements no worse than classical supervised learning …

Сохранить Цитировать Цитируется: 236 Похожие статьи Все версии статьи (6) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Regret analysis of stochastic and nonstochastic multi-armed bandit problems

S Bubeck, N Cesa-Bianchi - Foundations and Trends® in …, 2012 - nowpublishers.com

Multi-armed bandit problems are the most basic examples of sequential decision problems
with an exploration-exploitation trade-off. This is the balance between staying with the option …

Сохранить Цитировать Цитируется: 3287 Похожие статьи Все версии статьи (26) Поиск в библиотеках В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

[PDF][PDF] Counterfactual reasoning and learning systems: The example of computational advertising.

L Bottou, J Peters, J Quiñonero-Candela… - Journal of Machine …, 2013 - jmlr.org

This work shows how to leverage causal inference to understand the behavior of complex
learning systems interacting with their environment and predict the consequences of …

Сохранить Цитировать Цитируется: 842 Похожие статьи Все версии статьи (17) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Stochastic multi-armed-bandit problem with non-stationary rewards

O Besbes, Y Gur, A Zeevi - Advances in neural information …, 2014 - proceedings.neurips.cc

In a multi-armed bandit (MAB) problem a gambler needs to choose at each round of play
one of K arms, each characterized by an unknown reward distribution. Reward realizations …

Сохранить Цитировать Цитируется: 606 Похожие статьи Все версии статьи (16) Поиск в библиотеках В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Balanced linear contextual bandits

M Dimakopoulou, Z Zhou, S Athey… - Proceedings of the AAAI …, 2019 - ojs.aaai.org

Contextual bandit algorithms are sensitive to the estimation method of the outcome model as
well as the exploration method used, particularly in the presence of rich heterogeneity or …

Сохранить Цитировать Цитируется: 219 Похожие статьи Все версии статьи (12) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Estimation considerations in contextual bandits

M Dimakopoulou, Z Zhou, S Athey… - arxiv preprint arxiv …, 2017 - arxiv.org

Contextual bandit algorithms are sensitive to the estimation method of the outcome model as
well as the exploration method used, particularly in the presence of rich heterogeneity or …

Сохранить Цитировать Цитируется: 243 Похожие статьи Все версии статьи (6) В виде HTML

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Dynamic pricing and learning: historical origins, current research, and new directions

[КНИГА][B] Bandit algorithms

Introduction to multi-armed bandits

Online decision making with high-dimensional covariates

Beyond ucb: Optimal and efficient contextual bandits with regression oracles

Regret analysis of stochastic and nonstochastic multi-armed bandit problems

[PDF][PDF] Counterfactual reasoning and learning systems: The example of computational advertising.

Stochastic multi-armed-bandit problem with non-stationary rewards

Balanced linear contextual bandits

Estimation considerations in contextual bandits