- Academic Search

R Huang, W Wu, J Yang… - Advances in neural …, 2021 - proceedings.neurips.cc

This paper presents a novel federated linear contextual bandits model, where individual
clients face different $ K $-armed stochastic bandits coupled through common global …

保存引用被引用数: 85 関連記事全 9 バージョン HTMLバージョン

[Free GPT-4]

[PDF] nature.com

Efficient and targeted COVID-19 border testing via reinforcement learning

H Bastani, K Drakopoulos, V Gupta, I Vlachogiannis… - Nature, 2021 - nature.com

Throughout the coronavirus disease 2019 (COVID-19) pandemic, countries have relied on a
variety of ad hoc border control protocols to allow for non-essential travel while safeguarding …

保存引用被引用数: 126 関連記事全 11 バージョン

[Free GPT-4]

[PDF] ssrn.com

Customer acquisition via display advertising using multi-armed bandit experiments

EM Schwartz, ET Bradlow, PS Fader - Marketing Science, 2017 - pubsonline.informs.org

Firms using online advertising regularly run experiments with multiple versions of their ads
since they are uncertain about which ones are most effective. During a campaign, firms try to …

保存引用被引用数: 391 関連記事全 13 バージョン図書館検索

[Free GPT-4]

[PDF] neurips.cc

Batched multi-armed bandits problem

Z Gao, Y Han, Z Ren, Z Zhou - Advances in Neural …, 2019 - proceedings.neurips.cc

In this paper, we study the multi-armed bandit problem in the batched setting where the
employed policy must split data into a small number of batches. While the minimax regret for …

保存引用被引用数: 172 関連記事全 15 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Bypassing the monster: A faster and simpler optimal algorithm for contextual bandits under realizability

D Simchi-Levi, Y Xu - Mathematics of Operations Research, 2022 - pubsonline.informs.org

We consider the general (stochastic) contextual bandit problem under the realizability
assumption, that is, the expected reward, as a function of contexts and actions, belongs to a …

保存引用被引用数: 135 関連記事全 9 バージョン

[Free GPT-4]

[PDF] neurips.cc

Inference for batched bandits

K Zhang, L Janson, S Murphy - Advances in neural …, 2020 - proceedings.neurips.cc

As bandit algorithms are increasingly utilized in scientific studies and industrial applications,
there is an associated increasing need for reliable inference methods based on the resulting …

保存引用被引用数: 108 関連記事全 11 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Provably efficient q-learning with low switching cost

Y Bai, T **e, N Jiang, YX Wang - Advances in Neural …, 2019 - proceedings.neurips.cc

We take initial steps in studying PAC-MDP algorithms with limited adaptivity, that is,
algorithms that change its exploration policy as infrequently as possible during regret …

保存引用被引用数: 119 関連記事全 10 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Decentralized cooperative stochastic bandits

D Martínez-Rubio, V Kanade… - Advances in Neural …, 2019 - proceedings.neurips.cc

We study a decentralized cooperative stochastic multi-armed bandit problem with K arms on
a network of N agents. In our model, the reward distribution of each arm is the same for each …

保存引用被引用数: 123 関連記事全 8 バージョン HTMLバージョン

[Free GPT-4]

[PDF] mlr.press

Bandits with delayed, aggregated anonymous feedback

C Pike-Burke, S Agrawal… - International …, 2018 - proceedings.mlr.press

We study a variant of the stochastic $ K $-armed bandit problem, which we call" bandits with
delayed, aggregated anonymous feedback”. In this problem, when the player pulls an arm, a …

保存引用被引用数: 139 関連記事全 9 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

SIC-MMAB: Synchronisation involves communication in multiplayer multi-armed bandits

E Boursier, V Perchet - Advances in Neural Information …, 2019 - proceedings.neurips.cc

Motivated by cognitive radio networks, we consider the stochastic multiplayer multi-armed
bandit problem, where several players pull arms simultaneously and collisions occur if one …

保存引用被引用数: 117 関連記事全 14 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Batched bandit problems

Federated linear contextual bandits

Efficient and targeted COVID-19 border testing via reinforcement learning

Customer acquisition via display advertising using multi-armed bandit experiments

Batched multi-armed bandits problem

Bypassing the monster: A faster and simpler optimal algorithm for contextual bandits under realizability

Inference for batched bandits

Provably efficient q-learning with low switching cost

Decentralized cooperative stochastic bandits

Bandits with delayed, aggregated anonymous feedback

SIC-MMAB: Synchronisation involves communication in multiplayer multi-armed bandits