Meta-learning adversarial bandit algorithms

M Khodak, I Osadchiy, K Harris… - Advances in …, 2023 - proceedings.neurips.cc
We study online meta-learning with bandit feedback, with the goal of improving performance
across multiple tasks if they are similar according to some natural similarity measure. As the …

Feature and parameter selection in stochastic linear bandits

A Moradipari, B Turan… - International …, 2022 - proceedings.mlr.press
We study two model selection settings in stochastic linear bandits (LB). In the first setting,
which we refer to as feature selection, the expected reward of the LB problem is in the linear …

[หนังสือ][B] Learning and Pricing Algorithms for Human-Cyber-Physical Systems

A Moradipari - 2022 - search.proquest.com
Nowadays with the growth of large-scale societal infrastructure systems, there has been
significant research attention on improving efficiency, guaranteeing safety, reducing …