- Academic Search

L Yao, Z Chu, S Li, Y Li, J Gao, A Zhang - ACM Transactions on …, 2021 - dl.acm.org

Causal inference is a critical research topic across many domains, such as statistics,
computer science, education, public policy, and economics, for decades. Nowadays …

Zapisz Cytuj Cytowane przez 606 Powiązane artykuły Wszystkie wersje 6

[Free GPT-4]
[DeepSeek]

[PDF] ssrn.com

AI and personalization

O Rafieian, H Yoganarasimhan - Artificial Intelligence in Marketing, 2023 - emerald.com

This chapter reviews the recent developments at the intersection of personalization and AI in
marketing and related fields. We provide a formal definition of personalized policy and …

Zapisz Cytuj Cytowane przez 56 Powiązane artykuły Wszystkie wersje 6 Wyszukiwanie bibliotek

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Introduction to multi-armed bandits

A Slivkins - Foundations and Trends® in Machine Learning, 2019 - nowpublishers.com

Multi-armed bandits a simple but very powerful framework for algorithms that make
decisions over time under uncertainty. An enormous body of work has accumulated over the …

Zapisz Cytuj Cytowane przez 1259 Powiązane artykuły Wszystkie wersje 7 Wyszukiwanie bibliotek Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Beyond ucb: Optimal and efficient contextual bandits with regression oracles

D Foster, A Rakhlin - International Conference on Machine …, 2020 - proceedings.mlr.press

A fundamental challenge in contextual bandits is to develop flexible, general-purpose
algorithms with computational requirements no worse than classical supervised learning …

Zapisz Cytuj Cytowane przez 237 Powiązane artykuły Wszystkie wersje 6 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Balanced linear contextual bandits

M Dimakopoulou, Z Zhou, S Athey… - Proceedings of the AAAI …, 2019 - ojs.aaai.org

Contextual bandit algorithms are sensitive to the estimation method of the outcome model as
well as the exploration method used, particularly in the presence of rich heterogeneity or …

Zapisz Cytuj Cytowane przez 219 Powiązane artykuły Wszystkie wersje 12 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Estimation considerations in contextual bandits

M Dimakopoulou, Z Zhou, S Athey… - arxiv preprint arxiv …, 2017 - arxiv.org

Contextual bandit algorithms are sensitive to the estimation method of the outcome model as
well as the exploration method used, particularly in the presence of rich heterogeneity or …

Zapisz Cytuj Cytowane przez 243 Powiązane artykuły Wszystkie wersje 6 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bypassing the monster: A faster and simpler optimal algorithm for contextual bandits under realizability

D Simchi-Levi, Y Xu - Mathematics of Operations Research, 2022 - pubsonline.informs.org

We consider the general (stochastic) contextual bandit problem under the realizability
assumption, that is, the expected reward, as a function of contexts and actions, belongs to a …

Zapisz Cytuj Cytowane przez 136 Powiązane artykuły Wszystkie wersje 9

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mostly exploration-free algorithms for contextual bandits

H Bastani, M Bayati, K Khosravi - Management Science, 2021 - pubsonline.informs.org

The contextual bandit literature has traditionally focused on algorithms that address the
exploration–exploitation tradeoff. In particular, greedy algorithms that exploit current …

Zapisz Cytuj Cytowane przez 207 Powiązane artykuły Wszystkie wersje 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Instance-dependent complexity of contextual bandits and reinforcement learning: A disagreement-based perspective

DJ Foster, A Rakhlin, D Simchi-Levi, Y Xu - arxiv preprint arxiv …, 2020 - arxiv.org

In the classical multi-armed bandit problem, instance-dependent algorithms attain improved
performance on" easy" problems with a gap between the best and second-best arm. Are …

Zapisz Cytuj Cytowane przez 101 Powiązane artykuły Wszystkie wersje 4 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Contextual bandits with large action spaces: Made practical

Y Zhu, DJ Foster, J Langford… - … Conference on Machine …, 2022 - proceedings.mlr.press

A central problem in sequential decision making is to develop algorithms that are practical
and computationally efficient, yet support the use of flexible, general-purpose models …

Zapisz Cytuj Cytowane przez 39 Powiązane artykuły Wszystkie wersje 3 Wersja HTML

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

A contextual bandit bake-off

A survey on causal inference

AI and personalization

Introduction to multi-armed bandits

Beyond ucb: Optimal and efficient contextual bandits with regression oracles

Balanced linear contextual bandits

Estimation considerations in contextual bandits

Bypassing the monster: A faster and simpler optimal algorithm for contextual bandits under realizability

Mostly exploration-free algorithms for contextual bandits

Instance-dependent complexity of contextual bandits and reinforcement learning: A disagreement-based perspective

Contextual bandits with large action spaces: Made practical