- Academic Search

AV Den Boer - Surveys in operations research and management …, 2015 - Elsevier

The topic of dynamic pricing and learning has received a considerable amount of attention
in recent years, from different scientific communities. We survey these literature streams: we …

保存引用被引用数: 628 関連記事全 13 バージョン図書館検索

[Free GPT-4]

[PDF] tor-lattimore.com

[書籍][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com

Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

保存引用被引用数: 3281 関連記事全 9 バージョン図書館検索

[Free GPT-4]

[PDF] ssrn.com

Online decision making with high-dimensional covariates

H Bastani, M Bayati - Operations Research, 2020 - pubsonline.informs.org

Big data have enabled decision makers to tailor decisions at the individual level in a variety
of domains, such as personalized medicine and online advertising. Doing so involves …

保存引用被引用数: 623 関連記事全 12 バージョン図書館検索

[Free GPT-4]

[PDF] nowpublishers.com

Regret analysis of stochastic and nonstochastic multi-armed bandit problems

S Bubeck, N Cesa-Bianchi - Foundations and Trends® in …, 2012 - nowpublishers.com

Multi-armed bandit problems are the most basic examples of sequential decision problems
with an exploration-exploitation trade-off. This is the balance between staying with the option …

[Free GPT-4]

[PDF] aaai.org

Balanced linear contextual bandits

M Dimakopoulou, Z Zhou, S Athey… - Proceedings of the AAAI …, 2019 - ojs.aaai.org

Contextual bandit algorithms are sensitive to the estimation method of the outcome model as
well as the exploration method used, particularly in the presence of rich heterogeneity or …

保存引用被引用数: 218 関連記事全 12 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Estimation considerations in contextual bandits

M Dimakopoulou, Z Zhou, S Athey… - arxiv preprint arxiv …, 2017 - arxiv.org

Contextual bandit algorithms are sensitive to the estimation method of the outcome model as
well as the exploration method used, particularly in the presence of rich heterogeneity or …

保存引用被引用数: 243 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]

[PDF] ambujtewari.com

From ads to interventions: Contextual bandits in mobile health

A Tewari, SA Murphy - Mobile health: sensors, analytic methods, and …, 2017 - Springer

The first paper on contextual bandits was written by Michael Woodroofe in 1979 (Journal of
the American Statistical Association, 74 (368), 799–806, 1979) but the term “contextual …

保存引用被引用数: 249 関連記事全 5 バージョン

[Free GPT-4]

[PDF] projecteuclid.org

Batched bandit problems

V Perchet, P Rigollet, S Chassang, E Snowberg - 2016 - projecteuclid.org

Batched bandit problems Page 1 The Annals of Statistics 2016, Vol. 44, No. 2, 660–681 DOI:
10.1214/15-AOS1381 © Institute of Mathematical Statistics, 2016 BATCHED BANDIT …

保存引用被引用数: 278 関連記事全 26 バージョン

[Free GPT-4]

[PDF] neurips.cc

Batched multi-armed bandits problem

Z Gao, Y Han, Z Ren, Z Zhou - Advances in Neural …, 2019 - proceedings.neurips.cc

In this paper, we study the multi-armed bandit problem in the batched setting where the
employed policy must split data into a small number of batches. While the minimax regret for …

保存引用被引用数: 172 関連記事全 15 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Instance-dependent complexity of contextual bandits and reinforcement learning: A disagreement-based perspective

DJ Foster, A Rakhlin, D Simchi-Levi, Y Xu - arxiv preprint arxiv …, 2020 - arxiv.org

In the classical multi-armed bandit problem, instance-dependent algorithms attain improved
performance on" easy" problems with a gap between the best and second-best arm. Are …

保存引用被引用数: 101 関連記事全 4 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

The multi-armed bandit problem with covariates

Dynamic pricing and learning: historical origins, current research, and new directions

[書籍][B] Bandit algorithms

Online decision making with high-dimensional covariates

Regret analysis of stochastic and nonstochastic multi-armed bandit problems

Balanced linear contextual bandits

Estimation considerations in contextual bandits

From ads to interventions: Contextual bandits in mobile health

Batched bandit problems

Batched multi-armed bandits problem

Instance-dependent complexity of contextual bandits and reinforcement learning: A disagreement-based perspective