[BOOK][B] Bandit algorithms
T Lattimore, C Szepesvári - 2020 - books.google.com
Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …
and the multi-armed bandit model is a commonly used framework to address it. This …
Byzantine-robust distributed learning: Towards optimal statistical rates
In this paper, we develop distributed optimization algorithms that are provably robust against
Byzantine failures—arbitrary and potentially adversarial behavior, in distributed computing …
Byzantine failures—arbitrary and potentially adversarial behavior, in distributed computing …
Regret analysis of stochastic and nonstochastic multi-armed bandit problems
Multi-armed bandit problems are the most basic examples of sequential decision problems
with an exploration-exploitation trade-off. This is the balance between staying with the option …
with an exploration-exploitation trade-off. This is the balance between staying with the option …
Trustworthy distributed ai systems: Robustness, privacy, and governance
Emerging Distributed AI systems are revolutionizing big data computing and data
processing capabilities with growing economic and societal impact. However, recent studies …
processing capabilities with growing economic and societal impact. However, recent studies …
Why are adaptive methods good for attention models?
While stochastic gradient descent (SGD) is still the de facto algorithm in deep learning,
adaptive methods like Clipped SGD/Adam have been observed to outperform SGD across …
adaptive methods like Clipped SGD/Adam have been observed to outperform SGD across …
Mean estimation and regression under heavy-tailed distributions: A survey
We survey some of the recent advances in mean estimation and regression function
estimation. In particular, we describe sub-Gaussian mean estimators for possibly heavy …
estimation. In particular, we describe sub-Gaussian mean estimators for possibly heavy …
Stochastic bandits robust to adversarial corruptions
We introduce a new model of stochastic bandits with adversarial corruptions which aims to
capture settings where most of the input follows a stochastic pattern but some fraction of it …
capture settings where most of the input follows a stochastic pattern but some fraction of it …
Fast federated learning in the presence of arbitrary device unavailability
Federated learning (FL) coordinates with numerous heterogeneous devices to
collaboratively train a shared model while preserving user privacy. Despite its multiple …
collaboratively train a shared model while preserving user privacy. Despite its multiple …
Better algorithms for stochastic bandits with adversarial corruptions
We study the stochastic multi-armed bandits problem in the presence of adversarial
corruption. We present a new algorithm for this problem whose regret is nearly optimal …
corruption. We present a new algorithm for this problem whose regret is nearly optimal …
Geometric median and robust estimation in Banach spaces
S Minsker - 2015 - projecteuclid.org
In many real-world applications, collected data are contaminated by noise with heavy-tailed
distribution and might contain outliers of large magnitude. In this situation, it is necessary to …
distribution and might contain outliers of large magnitude. In this situation, it is necessary to …