- Academic Search

K Jamieson, R Nowak - 2014 48th annual conference on …, 2014 - ieeexplore.ieee.org

This paper is concerned with identifying the arm with the highest mean in a multi-armed
bandit problem using as few independent samples from the arms as possible. While the so …

Lagre Referanse Sitert av 252 Beslektede artikler Alle 5 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Non-stochastic best arm identification and hyperparameter optimization

K Jamieson, A Talwalkar - Artificial intelligence and statistics, 2016 - proceedings.mlr.press

Motivated by the task of hyperparameter optimization, we introduce the\em non-stochastic
best-arm identification problem. We identify an attractive algorithm for this setting that makes …

Lagre Referanse Sitert av 786 Beslektede artikler Alle 8 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Optimal best arm identification with fixed confidence

A Garivier, E Kaufmann - Conference on Learning Theory, 2016 - proceedings.mlr.press

We give a complete characterization of the complexity of best-arm identification in one-
parameter bandit problems. We prove a new, tight lower bound on the sample complexity …

Lagre Referanse Sitert av 437 Beslektede artikler Alle 15 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

[PDF][PDF] On the complexity of best-arm identification in multi-armed bandit models

E Kaufmann, O Cappé, A Garivier - The Journal of Machine Learning …, 2016 - jmlr.org

The stochastic multi-armed bandit model is a simple abstraction that has proven useful in
many different contexts in statistics and machine learning. Whereas the achievable limit in …

Lagre Referanse Sitert av 661 Beslektede artikler Alle 13 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Almost optimal exploration in multi-armed bandits

Z Karnin, T Koren, O Somekh - International conference on …, 2013 - proceedings.mlr.press

We study the problem of exploration in stochastic Multi-Armed Bandits. Even in the simplest
setting of identifying the best arm, there remains a logarithmic multiplicative gap between the …

Lagre Referanse Sitert av 613 Beslektede artikler Alle 9 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

lil'ucb: An optimal exploration algorithm for multi-armed bandits

K Jamieson, M Malloy, R Nowak… - … on Learning Theory, 2014 - proceedings.mlr.press

The paper proposes a novel upper confidence bound (UCB) procedure for identifying the
arm with the largest mean in a multi-armed bandit game in the fixed confidence setting using …

Lagre Referanse Sitert av 500 Beslektede artikler Alle 13 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Best arm identification: A unified approach to fixed budget and fixed confidence

V Gabillon, M Ghavamzadeh… - Advances in neural …, 2012 - proceedings.neurips.cc

We study the problem of identifying the best arm (s) in the stochastic multi-armed bandit
setting. This problem has been studied in the literature from two different perspectives: fixed …

Lagre Referanse Sitert av 378 Beslektede artikler Alle 15 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Top two algorithms revisited

M Jourdan, R Degenne, D Baudry… - Advances in …, 2022 - proceedings.neurips.cc

Top two algorithms arose as an adaptation of Thompson sampling to best arm identification
in multi-armed bandit models for parametric families of arms. They select the next arm to …

Lagre Referanse Sitert av 54 Beslektede artikler Alle 14 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Combinatorial pure exploration of multi-armed bandits

S Chen, T Lin, I King, MR Lyu… - Advances in neural …, 2014 - proceedings.neurips.cc

We study the {\em combinatorial pure exploration (CPE)} problem in the stochastic multi-
armed bandit setting, where a learner explores a set of arms with the objective of identifying …

Lagre Referanse Sitert av 256 Beslektede artikler Alle 11 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

An optimal algorithm for the thresholding bandit problem

A Locatelli, M Gutzeit… - … Conference on Machine …, 2016 - proceedings.mlr.press

We study a specific combinatorial pure exploration stochastic bandit problem where the
learner aims at finding the set of arms whose means are above a given threshold, up to a …

Lagre Referanse Sitert av 176 Beslektede artikler Alle 13 versjoner HTML-versjon

Opprett varsel

Referanse

Avansert søk

Lagret i Mitt bibliotek

PAC subset selection in stochastic multi-armed bandits.

Best-arm identification algorithms for multi-armed bandits in the fixed confidence setting

Non-stochastic best arm identification and hyperparameter optimization

Optimal best arm identification with fixed confidence

[PDF][PDF] On the complexity of best-arm identification in multi-armed bandit models

Almost optimal exploration in multi-armed bandits

lil'ucb: An optimal exploration algorithm for multi-armed bandits

Best arm identification: A unified approach to fixed budget and fixed confidence

Top two algorithms revisited

Combinatorial pure exploration of multi-armed bandits

An optimal algorithm for the thresholding bandit problem