- Academic Search

T Lattimore, C Szepesvári - 2020 - books.google.com

Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

Simpan Kutip Dirujuk 3323 kali Artikel terkait 9 versi Pencarian Perpustakaan

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Introduction to multi-armed bandits

A Slivkins - Foundations and Trends® in Machine Learning, 2019 - nowpublishers.com

Multi-armed bandits a simple but very powerful framework for algorithms that make
decisions over time under uncertainty. An enormous body of work has accumulated over the …

Simpan Kutip Dirujuk 1259 kali Artikel terkait 7 versi Pencarian Perpustakaan Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Bandits with knapsacks

A Badanidiyuru, R Kleinberg, A Slivkins - Journal of the ACM (JACM), 2018 - dl.acm.org

Multi-armed bandit problems are the predominant theoretical model of exploration-
exploitation tradeoffs in learning, and they have countless applications ranging from medical …

Simpan Kutip Dirujuk 537 kali Artikel terkait 10 versi

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Online task assignment in crowdsourcing markets

CJ Ho, J Vaughan - Proceedings of the AAAI conference on artificial …, 2012 - ojs.aaai.org

We explore the problem of assigning heterogeneous tasks to workers with different,
unknown skill sets in crowdsourcing markets such as Amazon Mechanical Turk. We first …

Simpan Kutip Dirujuk 450 kali Artikel terkait 10 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] psu.edu

Truthful incentives in crowdsourcing tasks using regret minimization mechanisms

A Singla, A Krause - Proceedings of the 22nd international conference …, 2013 - dl.acm.org

What price should be offered to a worker for a task in an online labor market? How can one
enable workers to express the amount they desire to receive for the task completion …

Simpan Kutip Dirujuk 345 kali Artikel terkait 10 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bandits with concave rewards and convex knapsacks

S Agrawal, NR Devanur - Proceedings of the fifteenth ACM conference …, 2014 - dl.acm.org

In this paper, we consider a very general model for exploration-exploitation tradeoff which
allows arbitrary concave rewards and convex constraints on the decisions across time, in …

Simpan Kutip Dirujuk 241 kali Artikel terkait 6 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Adversarial bandits with knapsacks

N Immorlica, K Sankararaman, R Schapire… - Journal of the ACM, 2022 - dl.acm.org

We consider Bandits with Knapsacks (henceforth, BwK), a general model for multi-armed
bandits under supply/budget constraints. In particular, a bandit algorithm needs to solve a …

Simpan Kutip Dirujuk 138 kali Artikel terkait 12 versi

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Linear contextual bandits with knapsacks

S Agrawal, N Devanur - Advances in neural information …, 2016 - proceedings.neurips.cc

We consider the linear contextual bandit problem with resource consumption, in addition to
reward generation. In each round, the outcome of pulling an arm is a reward as well as a …

Simpan Kutip Dirujuk 181 kali Artikel terkait 8 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Online learning with knapsacks: the best of both worlds

M Castiglioni, A Celli, C Kroer - International Conference on …, 2022 - proceedings.mlr.press

We study online learning problems in which a decision maker wants to maximize their
expected reward without violating a finite set of $ m $ resource constraints. By casting the …

Simpan Kutip Dirujuk 45 kali Artikel terkait 5 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Knapsack based optimal policies for budget–limited multi–armed bandits

L Tran-Thanh, A Chapman, A Rogers… - Proceedings of the AAAI …, 2012 - ojs.aaai.org

In budget–limited multi–armed bandit (MAB) problems, thelearner's actions are costly and
constrained by a fixed budget. Consequently, an optimal exploitation policy may not be …

Simpan Kutip Dirujuk 242 kali Artikel terkait 16 versi Versi HTML

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Epsilon–first policies for budget–limited multi-armed bandits

[BUKU][B] Bandit algorithms

Introduction to multi-armed bandits

Bandits with knapsacks

Online task assignment in crowdsourcing markets

Truthful incentives in crowdsourcing tasks using regret minimization mechanisms

Bandits with concave rewards and convex knapsacks

Adversarial bandits with knapsacks

Linear contextual bandits with knapsacks

Online learning with knapsacks: the best of both worlds

Knapsack based optimal policies for budget–limited multi–armed bandits