Študovňa Google

DJ Foster, SM Kakade, J Qian, A Rakhlin - arxiv preprint arxiv:2112.13487, 2021 - arxiv.org

A fundamental challenge in interactive learning and decision making, ranging from bandit
problems to reinforcement learning, is to provide sample-efficient, adaptive learning …

Uložiť Citovať Citované 220-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] tor-lattimore.com

[KNIHA][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com

Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

Uložiť Citovať Citované 3356-krát Súvisiace články Všetky verzie 9 Vyhľadávanie knižnice

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Derivative-free optimization methods

J Larson, M Menickelly, SM Wild - Acta Numerica, 2019 - cambridge.org

In many optimization problems arising from scientific, engineering and artificial intelligence
applications, objective and constraint functions are available only as the output of a black …

Uložiť Citovať Citované 525-krát Súvisiace články Všetky verzie 12

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Introduction to multi-armed bandits

A Slivkins - Foundations and Trends® in Machine Learning, 2019 - nowpublishers.com

Multi-armed bandits a simple but very powerful framework for algorithms that make
decisions over time under uncertainty. An enormous body of work has accumulated over the …

Uložiť Citovať Citované 1272-krát Súvisiace články Všetky verzie 7 Vyhľadávanie knižnice HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Introduction to online convex optimization

E Hazan - Foundations and Trends® in Optimization, 2016 - nowpublishers.com

This monograph portrays optimization as a process. In many practical applications the
environment is so complex that it is infeasible to lay out a comprehensive theoretical model …

Uložiť Citovať Citované 2219-krát Súvisiace články Všetky verzie 19 Vyhľadávanie knižnice HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Strategic classification from revealed preferences

J Dong, A Roth, Z Schutzman, B Waggoner… - Proceedings of the 2018 …, 2018 - dl.acm.org

We study an online linear classification problem in which the data is generated by strategic
agents who manipulate their features in an effort to change the classification outcome. In …

Uložiť Citovať Citované 222-krát Súvisiace články Všetky verzie 7

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

More adaptive algorithms for adversarial bandits

CY Wei, H Luo - Conference On Learning Theory, 2018 - proceedings.mlr.press

We develop a novel and generic algorithm for the adversarial multi-armed bandit problem
(or more generally the combinatorial semi-bandit problem). When instantiated differently, our …

Uložiť Citovať Citované 191-krát Súvisiace články Všetky verzie 6 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Corralling a band of bandit algorithms

A Agarwal, H Luo, B Neyshabur… - … on Learning Theory, 2017 - proceedings.mlr.press

We study the problem of combining multiple bandit algorithms (that is, online learning
algorithms with partial feedback) with the goal of creating a master algorithm that performs …

Uložiť Citovať Citované 197-krát Súvisiace články Všetky verzie 6 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Tight guarantees for interactive decision making with the decision-estimation coefficient

DJ Foster, N Golowich, Y Han - The Thirty Sixth Annual …, 2023 - proceedings.mlr.press

A foundational problem in reinforcement learning and interactive decision making is to
understand what modeling assumptions lead to sample-efficient learning guarantees, and …

Uložiť Citovať Citované 41-krát Súvisiace články Všetky verzie 4 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Adversarial bandits with knapsacks

N Immorlica, K Sankararaman, R Schapire… - Journal of the ACM, 2022 - dl.acm.org

We consider Bandits with Knapsacks (henceforth, BwK), a general model for multi-armed
bandits under supply/budget constraints. In particular, a bandit algorithm needs to solve a …

Uložiť Citovať Citované 140-krát Súvisiace články Všetky verzie 12

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Kernel-based methods for bandit convex optimization

The statistical complexity of interactive decision making

[KNIHA][B] Bandit algorithms

Derivative-free optimization methods

Introduction to multi-armed bandits

Introduction to online convex optimization

Strategic classification from revealed preferences

More adaptive algorithms for adversarial bandits

Corralling a band of bandit algorithms

Tight guarantees for interactive decision making with the decision-estimation coefficient

Adversarial bandits with knapsacks