A modern introduction to online learning

F Orabona - arxiv preprint arxiv:1912.13213, 2019 - arxiv.org
In this monograph, I introduce the basic concepts of Online Learning through a modern view
of Online Convex Optimization. Here, online learning refers to the framework of regret …

Estimating means of bounded random variables by betting

I Waudby-Smith, A Ramdas - Journal of the Royal Statistical …, 2024 - academic.oup.com
We derive confidence intervals (CIs) and confidence sequences (CSs) for the classical
problem of estimating a bounded mean. Our approach generalizes and improves on the …

Tight concentrations and confidence sequences from the regret of universal portfolio

F Orabona, KS Jun - IEEE Transactions on Information Theory, 2023 - ieeexplore.ieee.org
A classic problem in statistics is the estimation of the expectation of random variables from
samples. This gives rise to the tightly connected problems of deriving concentration …

Tighter PAC-Bayes bounds through coin-betting

K Jang, KS Jun, I Kuzborskij… - The Thirty Sixth Annual …, 2023 - proceedings.mlr.press
We consider the problem of estimating the mean of a sequence of random elements $ f
(\theta, X_1) $$,\ldots, $$ f (\theta, X_n) $ where $ f $ is a fixed scalar function …

Time-uniform self-normalized concentration for vector-valued processes

J Whitehouse, ZS Wu, A Ramdas - arxiv preprint arxiv:2310.09100, 2023 - arxiv.org
Self-normalized processes arise naturally in many statistical tasks. While self-normalized
concentration has been extensively studied for scalar-valued processes, there is less work …

Parameter-free regret in high probability with heavy tails

J Zhang, A Cutkosky - Advances in Neural Information …, 2022 - proceedings.neurips.cc
We present new algorithms for online convex optimization over unbounded domains that
obtain parameter-free regret in high-probability given access only to potentially heavy-tailed …

Online learning with imperfect hints

A Bhaskara, A Cutkosky, R Kumar… - … on Machine Learning, 2020 - proceedings.mlr.press
We consider a variant of the classical online linear optimization problem in which at every
step, the online player receives a “hint” vector before choosing the action for that round …

Auditing fairness by betting

B Chugg, S Cortes-Gomez, B Wilder… - Advances in Neural …, 2023 - proceedings.neurips.cc
We provide practical, efficient, and nonparametric methods for auditing the fairness of
deployed classification and regression models. Whereas previous work relies on a fixed …

Empirical Bernstein in smooth Banach spaces

D Martinez-Taboada, A Ramdas - arxiv preprint arxiv:2409.06060, 2024 - arxiv.org
Existing concentration bounds for bounded vector-valued random variables include
extensions of the scalar Hoeffding and Bernstein inequalities. While the latter is typically …

Improved regret bounds of (multinomial) logistic bandits via regret-to-confidence-set conversion

J Lee, SY Yun, KS Jun - International Conference on …, 2024 - proceedings.mlr.press
Logistic bandit is a ubiquitous framework of modeling users' choices, eg, click vs. no click for
advertisement recommender system. We observe that the prior works overlook or neglect …