Študovňa Google

Články

Študovňa

Počet výsledkov: 2 (0,01 s)

Môj profil Moja knižnica

Nearest neighbour with bandit feedback

Hľadať v citovaných článkoch

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bandits with Abstention under Expert Advice

S Pasteris, A Rumi, M Thiessen, S Saito… - arxiv preprint arxiv …, 2024 - arxiv.org

We study the classic problem of prediction with expert advice under bandit feedback. Our
model assumes that one action, corresponding to the learner's abstention from play, has no …

Uložiť Citovať Citované 1-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A hierarchical nearest neighbour approach to contextual bandits

S Pasteris, C Hicks, V Mavroudis - arxiv preprint arxiv:2312.09332, 2023 - arxiv.org

In this paper we consider the adversarial contextual bandit problem in metric spaces. The
paper" Nearest neighbour with bandit feedback" tackled this problem but when there are …

Uložiť Citovať Citované 1-krát Súvisiace články Všetky verzie 2 HTML verzia

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Nearest neighbour with bandit feedback

Bandits with Abstention under Expert Advice

A hierarchical nearest neighbour approach to contextual bandits