Taking the human out of the loop: A review of Bayesian optimization
Big Data applications are typically associated with systems involving large numbers of
users, massive complex software systems, and large-scale heterogeneous computing and …
users, massive complex software systems, and large-scale heterogeneous computing and …
A unified framework for stochastic optimization
WB Powell - European Journal of Operational Research, 2019 - Elsevier
Stochastic optimization is an umbrella term that includes over a dozen fragmented
communities, using a patchwork of sometimes overlap** notational systems with …
communities, using a patchwork of sometimes overlap** notational systems with …
Causal machine learning: A survey and open problems
Causal Machine Learning (CausalML) is an umbrella term for machine learning methods
that formalize the data-generation process as a structural causal model (SCM). This …
that formalize the data-generation process as a structural causal model (SCM). This …
Non-stochastic best arm identification and hyperparameter optimization
Motivated by the task of hyperparameter optimization, we introduce the\em non-stochastic
best-arm identification problem. We identify an attractive algorithm for this setting that makes …
best-arm identification problem. We identify an attractive algorithm for this setting that makes …
[BOOK][B] Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions: by Warren B. Powell (ed.), Wiley (2022). Hardback. ISBN …
I Halperin - 2022 - Taylor & Francis
What is reinforcement learning? How is reinforcement learning different from stochastic
optimization? And finally, can it be used for applications to quantitative finance for my current …
optimization? And finally, can it be used for applications to quantitative finance for my current …
[PDF][PDF] On the complexity of best-arm identification in multi-armed bandit models
The stochastic multi-armed bandit model is a simple abstraction that has proven useful in
many different contexts in statistics and machine learning. Whereas the achievable limit in …
many different contexts in statistics and machine learning. Whereas the achievable limit in …
Almost optimal exploration in multi-armed bandits
We study the problem of exploration in stochastic Multi-Armed Bandits. Even in the simplest
setting of identifying the best arm, there remains a logarithmic multiplicative gap between the …
setting of identifying the best arm, there remains a logarithmic multiplicative gap between the …
Optimal best arm identification with fixed confidence
We give a complete characterization of the complexity of best-arm identification in one-
parameter bandit problems. We prove a new, tight lower bound on the sample complexity …
parameter bandit problems. We prove a new, tight lower bound on the sample complexity …
lil'ucb: An optimal exploration algorithm for multi-armed bandits
The paper proposes a novel upper confidence bound (UCB) procedure for identifying the
arm with the largest mean in a multi-armed bandit game in the fixed confidence setting using …
arm with the largest mean in a multi-armed bandit game in the fixed confidence setting using …
Learning to learn without gradient descent by gradient descent
We learn recurrent neural network optimizers trained on simple synthetic functions by
gradient descent. We show that these learned optimizers exhibit a remarkable degree of …
gradient descent. We show that these learned optimizers exhibit a remarkable degree of …