Google Академія

F Orabona - arxiv preprint arxiv:1912.13213, 2019 - arxiv.org

In this monograph, I introduce the basic concepts of Online Learning through a modern view
of Online Convex Optimization. Here, online learning refers to the framework of regret …

Зберегти Послатися Цитовано в 439 джерелах Пов’язані статті Кількість версій: 3 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Estimating means of bounded random variables by betting

I Waudby-Smith, A Ramdas - Journal of the Royal Statistical …, 2024 - academic.oup.com

We derive confidence intervals (CIs) and confidence sequences (CSs) for the classical
problem of estimating a bounded mean. Our approach generalizes and improves on the …

Зберегти Послатися Цитовано в 200 джерелах Пов’язані статті Кількість версій: 10

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Tight concentrations and confidence sequences from the regret of universal portfolio

F Orabona, KS Jun - IEEE Transactions on Information Theory, 2023 - ieeexplore.ieee.org

A classic problem in statistics is the estimation of the expectation of random variables from
samples. This gives rise to the tightly connected problems of deriving concentration …

Зберегти Послатися Цитовано в 80 джерелах Пов’язані статті Кількість версій: 10

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Tighter PAC-Bayes bounds through coin-betting

K Jang, KS Jun, I Kuzborskij… - The Thirty Sixth Annual …, 2023 - proceedings.mlr.press

We consider the problem of estimating the mean of a sequence of random elements $ f
(\theta, X_1) $$,\ldots, $$ f (\theta, X_n) $ where $ f $ is a fixed scalar function …

Зберегти Послатися Цитовано в 31 джерелах Пов’язані статті Кількість версій: 5 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Time-uniform self-normalized concentration for vector-valued processes

J Whitehouse, ZS Wu, A Ramdas - arxiv preprint arxiv:2310.09100, 2023 - arxiv.org

Self-normalized processes arise naturally in many statistical tasks. While self-normalized
concentration has been extensively studied for scalar-valued processes, there is less work …

Зберегти Послатися Цитовано в 22 джерелах Пов’язані статті Кількість версій: 3 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Parameter-free regret in high probability with heavy tails

J Zhang, A Cutkosky - Advances in Neural Information …, 2022 - proceedings.neurips.cc

We present new algorithms for online convex optimization over unbounded domains that
obtain parameter-free regret in high-probability given access only to potentially heavy-tailed …

Зберегти Послатися Цитовано в 21 джерелах Пов’язані статті Кількість версій: 7 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Online learning with imperfect hints

A Bhaskara, A Cutkosky, R Kumar… - … on Machine Learning, 2020 - proceedings.mlr.press

We consider a variant of the classical online linear optimization problem in which at every
step, the online player receives a “hint” vector before choosing the action for that round …

Зберегти Послатися Цитовано в 65 джерелах Пов’язані статті Кількість версій: 6 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Auditing fairness by betting

B Chugg, S Cortes-Gomez, B Wilder… - Advances in Neural …, 2023 - proceedings.neurips.cc

We provide practical, efficient, and nonparametric methods for auditing the fairness of
deployed classification and regression models. Whereas previous work relies on a fixed …

Зберегти Послатися Цитовано в 7 джерелах Пов’язані статті Кількість версій: 6 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Empirical Bernstein in smooth Banach spaces

D Martinez-Taboada, A Ramdas - arxiv preprint arxiv:2409.06060, 2024 - arxiv.org

Existing concentration bounds for bounded vector-valued random variables include
extensions of the scalar Hoeffding and Bernstein inequalities. While the latter is typically …

Зберегти Послатися Цитовано в 8 джерелах Пов’язані статті Кількість версій: 2 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Improved regret bounds of (multinomial) logistic bandits via regret-to-confidence-set conversion

J Lee, SY Yun, KS Jun - International Conference on …, 2024 - proceedings.mlr.press

Logistic bandit is a ubiquitous framework of modeling users' choices, eg, click vs. no click for
advertisement recommender system. We observe that the prior works overlook or neglect …

Зберегти Послатися Цитовано в 11 джерелах Пов’язані статті Кількість версій: 7 Показати у форматі HTML

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Parameter-free online convex optimization with sub-exponential noise

A modern introduction to online learning

Estimating means of bounded random variables by betting

Tight concentrations and confidence sequences from the regret of universal portfolio

Tighter PAC-Bayes bounds through coin-betting

Time-uniform self-normalized concentration for vector-valued processes

Parameter-free regret in high probability with heavy tails

Online learning with imperfect hints

Auditing fairness by betting

Empirical Bernstein in smooth Banach spaces

Improved regret bounds of (multinomial) logistic bandits via regret-to-confidence-set conversion