- Academic Search

LJ Hong, W Fan, J Luo - Frontiers of Engineering Management, 2021 - Springer

In this paper, we briefly review the development of ranking and selection (R&S) in the past
70 years, especially the theoretical achievements and practical applications in the past 20 …

Save Cite Cited by 125 Related articles All 12 versions Free GPT-4

[Free GPT-4]

[PDF] jmlr.org

Hyperband: A novel bandit-based approach to hyperparameter optimization

L Li, K Jamieson, G DeSalvo, A Rostamizadeh… - Journal of Machine …, 2018 - jmlr.org

Performance of machine learning algorithms depends critically on identifying a good set of
hyperparameters. While recent approaches use Bayesian optimization to adaptively select …

Save Cite Cited by 3163 Related articles All 13 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] washington.edu

Best-arm identification algorithms for multi-armed bandits in the fixed confidence setting

K Jamieson, R Nowak - 2014 48th annual conference on …, 2014 - ieeexplore.ieee.org

This paper is concerned with identifying the arm with the highest mean in a multi-armed
bandit problem using as few independent samples from the arms as possible. While the so …

Save Cite Cited by 251 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Non-stochastic best arm identification and hyperparameter optimization

K Jamieson, A Talwalkar - Artificial intelligence and statistics, 2016 - proceedings.mlr.press

Motivated by the task of hyperparameter optimization, we introduce the\em non-stochastic
best-arm identification problem. We identify an attractive algorithm for this setting that makes …

Save Cite Cited by 775 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] jmlr.org

[PDF][PDF] On the complexity of best-arm identification in multi-armed bandit models

E Kaufmann, O Cappé, A Garivier - The Journal of Machine Learning …, 2016 - jmlr.org

The stochastic multi-armed bandit model is a simple abstraction that has proven useful in
many different contexts in statistics and machine learning. Whereas the achievable limit in …

Save Cite Cited by 656 Related articles All 14 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] projecteuclid.org

Game-theoretic statistics and safe anytime-valid inference

A Ramdas, P Grünwald, V Vovk, G Shafer - Statistical Science, 2023 - projecteuclid.org

Safe anytime-valid inference (SAVI) provides measures of statistical evidence and certainty—
e-processes for testing and confidence sequences for estimation—that remain valid at all …

Save Cite Cited by 151 Related articles All 12 versions Free GPT-4

[Free GPT-4]

[PDF] projecteuclid.org

Time-uniform, nonparametric, nonasymptotic confidence sequences

SR Howard, A Ramdas, J McAuliffe, J Sekhon - 2021 - projecteuclid.org

Time-uniform, nonparametric, nonasymptotic confidence sequences Page 1 The Annals of
Statistics 2021, Vol. 49, No. 2, 1055–1080 https://doi.org/10.1214/20-AOS1991 © Institute of …

Save Cite Cited by 320 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Optimal best arm identification with fixed confidence

A Garivier, E Kaufmann - Conference on Learning Theory, 2016 - proceedings.mlr.press

We give a complete characterization of the complexity of best-arm identification in one-
parameter bandit problems. We prove a new, tight lower bound on the sample complexity …

Save Cite Cited by 432 Related articles All 16 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Top two algorithms revisited

M Jourdan, R Degenne, D Baudry… - Advances in …, 2022 - proceedings.neurips.cc

Top two algorithms arose as an adaptation of Thompson sampling to best arm identification
in multi-armed bandit models for parametric families of arms. They select the next arm to …

Save Cite Cited by 52 Related articles All 12 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] acm.org Full View

Anytime-valid off-policy inference for contextual bandits

I Waudby-Smith, L Wu, A Ramdas… - ACM/JMS Journal of …, 2024 - dl.acm.org

Contextual bandit algorithms are ubiquitous tools for active sequential experimentation in
healthcare and the tech industry. They involve online learning algorithms that adaptively …

Save Cite Cited by 30 Related articles All 5 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

lil’ucb: An optimal exploration algorithm for multi-armed bandits

Review on ranking and selection: A new perspective

Hyperband: A novel bandit-based approach to hyperparameter optimization

Best-arm identification algorithms for multi-armed bandits in the fixed confidence setting

Non-stochastic best arm identification and hyperparameter optimization

[PDF][PDF] On the complexity of best-arm identification in multi-armed bandit models

Game-theoretic statistics and safe anytime-valid inference

Time-uniform, nonparametric, nonasymptotic confidence sequences

Optimal best arm identification with fixed confidence

Top two algorithms revisited

Anytime-valid off-policy inference for contextual bandits