- Academic Search

T Lattimore, C Szepesvári - 2020 - books.google.com

Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

Save Cite Cited by 3277 Related articles All 9 versions Free GPT-4 Library Search

[Free GPT-4]

[PDF] jmlr.org

[PDF][PDF] On the complexity of best-arm identification in multi-armed bandit models

E Kaufmann, O Cappé, A Garivier - The Journal of Machine Learning …, 2016 - jmlr.org

The stochastic multi-armed bandit model is a simple abstraction that has proven useful in
many different contexts in statistics and machine learning. Whereas the achievable limit in …

Save Cite Cited by 653 Related articles All 14 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Batched multi-armed bandits problem

Z Gao, Y Han, Z Ren, Z Zhou - Advances in Neural …, 2019 - proceedings.neurips.cc

In this paper, we study the multi-armed bandit problem in the batched setting where the
employed policy must split data into a small number of batches. While the minimax regret for …

Save Cite Cited by 171 Related articles All 15 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] projecteuclid.org

Batched bandit problems

V Perchet, P Rigollet, S Chassang, E Snowberg - 2016 - projecteuclid.org

Batched bandit problems Page 1 The Annals of Statistics 2016, Vol. 44, No. 2, 660–681 DOI:
10.1214/15-AOS1381 © Institute of Mathematical Statistics, 2016 BATCHED BANDIT …

Save Cite Cited by 277 Related articles All 26 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Explore first, exploit next: The true shape of regret in bandit problems

A Garivier, P Ménard, G Stoltz - Mathematics of Operations …, 2019 - pubsonline.informs.org

We revisit lower bounds on the regret in the case of multiarmed bandit problems. We obtain
nonasymptotic, distribution-dependent bounds and provide simple proofs based only on …

Save Cite Cited by 214 Related articles All 12 versions Free GPT-4

Learning unknown service rates in queues: A multiarmed bandit approach

S Krishnasamy, R Sen, R Johari… - Operations …, 2021 - pubsonline.informs.org

Consider a queueing system consisting of multiple servers. Jobs arrive over time and enter a
queue for service; the goal is to minimize the size of this queue. At each opportunity for …

Save Cite Cited by 67 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Beating stochastic and adversarial semi-bandits optimally and simultaneously

J Zimmert, H Luo, CY Wei - International Conference on …, 2019 - proceedings.mlr.press

We develop the first general semi-bandit algorithm that simultaneously achieves $\mathcal
{O}(\log T) $ regret for stochastic environments and $\mathcal {O}(\sqrt {T}) $ regret for …

Save Cite Cited by 96 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

On explore-then-commit strategies

A Garivier, T Lattimore… - Advances in Neural …, 2016 - proceedings.neurips.cc

We study the problem of minimising regret in two-armed bandit problems with Gaussian
rewards. Our objective is to use this simple setting to illustrate that strategies based on an …

Save Cite Cited by 136 Related articles All 14 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] ieee.org

Risk-averse multi-armed bandit problems under mean-variance measure

S Vakili, Q Zhao - IEEE Journal of Selected Topics in Signal …, 2016 - ieeexplore.ieee.org

The multi-armed bandit (MAB) problems have been studied mainly under the measure of
expected total reward accrued over a horizon of length T. In this paper, we address the issue …

Save Cite Cited by 111 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] mlr.press

Online learning in repeated auctions

J Weed, V Perchet, P Rigollet - Conference on Learning …, 2016 - proceedings.mlr.press

Motivated by online advertising auctions, we consider repeated Vickrey auctions where
goods of unknown value are sold sequentially and bidders only learn (potentially noisy) …

Save Cite Cited by 109 Related articles All 15 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Bounded regret in stochastic multi-armed bandits

[BOOK][B] Bandit algorithms

[PDF][PDF] On the complexity of best-arm identification in multi-armed bandit models

Batched multi-armed bandits problem

Batched bandit problems

Explore first, exploit next: The true shape of regret in bandit problems

Learning unknown service rates in queues: A multiarmed bandit approach

Beating stochastic and adversarial semi-bandits optimally and simultaneously

On explore-then-commit strategies

Risk-averse multi-armed bandit problems under mean-variance measure

Online learning in repeated auctions