- Academic Search

B Shahriari, K Swersky, Z Wang… - Proceedings of the …, 2015 - ieeexplore.ieee.org

Big Data applications are typically associated with systems involving large numbers of
users, massive complex software systems, and large-scale heterogeneous computing and …

Save Cite Cited by 6019 Related articles All 14 versions Free GPT-4

[Free GPT-4]

[PDF] princeton.edu

A unified framework for stochastic optimization

WB Powell - European Journal of Operational Research, 2019 - Elsevier

Stochastic optimization is an umbrella term that includes over a dozen fragmented
communities, using a patchwork of sometimes overlap** notational systems with …

Save Cite Cited by 357 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Causal machine learning: A survey and open problems

J Kaddour, A Lynch, Q Liu, MJ Kusner… - arxiv preprint arxiv …, 2022 - arxiv.org

Causal Machine Learning (CausalML) is an umbrella term for machine learning methods
that formalize the data-generation process as a structural causal model (SCM). This …

Save Cite Cited by 176 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Non-stochastic best arm identification and hyperparameter optimization

K Jamieson, A Talwalkar - Artificial intelligence and statistics, 2016 - proceedings.mlr.press

Motivated by the task of hyperparameter optimization, we introduce the\em non-stochastic
best-arm identification problem. We identify an attractive algorithm for this setting that makes …

Save Cite Cited by 773 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] tandfonline.com

[BOOK][B] Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions: by Warren B. Powell (ed.), Wiley (2022). Hardback. ISBN …

I Halperin - 2022 - Taylor & Francis

What is reinforcement learning? How is reinforcement learning different from stochastic
optimization? And finally, can it be used for applications to quantitative finance for my current …

Save Cite Cited by 216 Related articles All 6 versions Free GPT-4 Library Search

[Free GPT-4]

[PDF] jmlr.org

[PDF][PDF] On the complexity of best-arm identification in multi-armed bandit models

E Kaufmann, O Cappé, A Garivier - The Journal of Machine Learning …, 2016 - jmlr.org

The stochastic multi-armed bandit model is a simple abstraction that has proven useful in
many different contexts in statistics and machine learning. Whereas the achievable limit in …

Save Cite Cited by 653 Related articles All 14 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Almost optimal exploration in multi-armed bandits

Z Karnin, T Koren, O Somekh - International conference on …, 2013 - proceedings.mlr.press

We study the problem of exploration in stochastic Multi-Armed Bandits. Even in the simplest
setting of identifying the best arm, there remains a logarithmic multiplicative gap between the …

Save Cite Cited by 612 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Optimal best arm identification with fixed confidence

A Garivier, E Kaufmann - Conference on Learning Theory, 2016 - proceedings.mlr.press

We give a complete characterization of the complexity of best-arm identification in one-
parameter bandit problems. We prove a new, tight lower bound on the sample complexity …

Save Cite Cited by 429 Related articles All 16 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

lil'ucb: An optimal exploration algorithm for multi-armed bandits

K Jamieson, M Malloy, R Nowak… - … on Learning Theory, 2014 - proceedings.mlr.press

The paper proposes a novel upper confidence bound (UCB) procedure for identifying the
arm with the largest mean in a multi-armed bandit game in the fixed confidence setting using …

Save Cite Cited by 502 Related articles All 14 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Learning to learn without gradient descent by gradient descent

Y Chen, MW Hoffman… - International …, 2017 - proceedings.mlr.press

We learn recurrent neural network optimizers trained on simple synthetic functions by
gradient descent. We show that these learned optimizers exhibit a remarkable degree of …

Save Cite Cited by 322 Related articles All 7 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Best arm identification: A unified approach to fixed budget and fixed confidence

Taking the human out of the loop: A review of Bayesian optimization

A unified framework for stochastic optimization

Causal machine learning: A survey and open problems

Non-stochastic best arm identification and hyperparameter optimization

[BOOK][B] Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions: by Warren B. Powell (ed.), Wiley (2022). Hardback. ISBN …

[PDF][PDF] On the complexity of best-arm identification in multi-armed bandit models

Almost optimal exploration in multi-armed bandits

Optimal best arm identification with fixed confidence

lil'ucb: An optimal exploration algorithm for multi-armed bandits

Learning to learn without gradient descent by gradient descent