The Real Price of Bandit Information in Multiclass Classification

L Erez, A Cohen, T Koren, Y Mansour… - arxiv preprint arxiv …, 2024 - arxiv.org
We revisit the classical problem of multiclass classification with bandit feedback (Kakade,
Shalev-Shwartz and Tewari, 2008), where each input classifies to one of $ K $ possible …

A Unified Framework for Bandit Online Multiclass Prediction

W Feng, X Gao, P Zhao, SCH Hoi - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Bandit online multiclass prediction plays an important role in many real-world applications.
In this paper, we propose a unified framework for it in the fully adversarial setting. This …

[PDF][PDF] Apple Tasting: Combinatorial Dimensions and Minimax Rates

V Raman, U Subedi, A Raman… - Proceedings of Machine …, 2024 - ambujtewari.com
In online binary classification under apple tasting feedback, the learner only observes the
true label if it predicts “1”. First studied by Helmbold et al.(2000a), we revisit this classical …

Bandit Multiclass List Classification

L Erez, T Koren - arxiv preprint arxiv:2502.09257, 2025 - arxiv.org
We study the problem of multiclass list classification with (semi-) bandit feedback, where
input examples are mapped into subsets of size $ m $ of a collection of $ K $ possible …

Fast Rates for Bandit PAC Multiclass Classification

L Erez, A Cohen, T Koren, Y Mansour… - arxiv preprint arxiv …, 2024 - arxiv.org
We study multiclass PAC learning with bandit feedback, where inputs are classified into one
of $ K $ possible labels and feedback is limited to whether or not the predicted labels are …

Strategic Littlestone Dimension: Improved Bounds on Online Strategic Classification

S Ahmadi, K Yang, H Zhang - arxiv preprint arxiv:2407.11619, 2024 - arxiv.org
We study the problem of online binary classification in settings where strategic agents can
modify their observable features to receive a positive classification. We model the set of …

Bandit-Feedback Online Multiclass Classification: Variants and Tradeoffs

Y Filmus, S Hanneke, I Mehalel, S Moran - arxiv preprint arxiv …, 2024 - arxiv.org
Consider the domain of multiclass classification within the adversarial online setting. What is
the price of relying on bandit feedback as opposed to full information? To what extent can an …