[BOOK][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com
Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

Byzantine-robust distributed learning: Towards optimal statistical rates

D Yin, Y Chen, R Kannan… - … conference on machine …, 2018 - proceedings.mlr.press
In this paper, we develop distributed optimization algorithms that are provably robust against
Byzantine failures—arbitrary and potentially adversarial behavior, in distributed computing …

Regret analysis of stochastic and nonstochastic multi-armed bandit problems

S Bubeck, N Cesa-Bianchi - Foundations and Trends® in …, 2012 - nowpublishers.com
Multi-armed bandit problems are the most basic examples of sequential decision problems
with an exploration-exploitation trade-off. This is the balance between staying with the option …

Trustworthy distributed ai systems: Robustness, privacy, and governance

W Wei, L Liu - ACM Computing Surveys, 2024 - dl.acm.org
Emerging Distributed AI systems are revolutionizing big data computing and data
processing capabilities with growing economic and societal impact. However, recent studies …

Why are adaptive methods good for attention models?

J Zhang, SP Karimireddy, A Veit… - Advances in …, 2020 - proceedings.neurips.cc
While stochastic gradient descent (SGD) is still the de facto algorithm in deep learning,
adaptive methods like Clipped SGD/Adam have been observed to outperform SGD across …

Mean estimation and regression under heavy-tailed distributions: A survey

G Lugosi, S Mendelson - Foundations of Computational Mathematics, 2019 - Springer
We survey some of the recent advances in mean estimation and regression function
estimation. In particular, we describe sub-Gaussian mean estimators for possibly heavy …

Stochastic bandits robust to adversarial corruptions

T Lykouris, V Mirrokni, R Paes Leme - … of the 50th Annual ACM SIGACT …, 2018 - dl.acm.org
We introduce a new model of stochastic bandits with adversarial corruptions which aims to
capture settings where most of the input follows a stochastic pattern but some fraction of it …

Fast federated learning in the presence of arbitrary device unavailability

X Gu, K Huang, J Zhang… - Advances in Neural …, 2021 - proceedings.neurips.cc
Federated learning (FL) coordinates with numerous heterogeneous devices to
collaboratively train a shared model while preserving user privacy. Despite its multiple …

Better algorithms for stochastic bandits with adversarial corruptions

A Gupta, T Koren, K Talwar - Conference on Learning …, 2019 - proceedings.mlr.press
We study the stochastic multi-armed bandits problem in the presence of adversarial
corruption. We present a new algorithm for this problem whose regret is nearly optimal …

Geometric median and robust estimation in Banach spaces

S Minsker - 2015 - projecteuclid.org
In many real-world applications, collected data are contaminated by noise with heavy-tailed
distribution and might contain outliers of large magnitude. In this situation, it is necessary to …