Recent advances in algorithmic high-dimensional robust statistics
Learning in the presence of outliers is a fundamental problem in statistics. Until recently, all
known efficient unsupervised learning algorithms were very sensitive to outliers in high …
known efficient unsupervised learning algorithms were very sensitive to outliers in high …
An overview of robust subspace recovery
This paper will serve as an introduction to the body of work on robust subspace recovery.
Robust subspace recovery involves finding an underlying low-dimensional subspace in a …
Robust subspace recovery involves finding an underlying low-dimensional subspace in a …
[PDF][PDF] Manipulating the byzantine: Optimizing model poisoning attacks and defenses for federated learning
Federated learning (FL) enables many data owners (eg, mobile devices) to train a joint ML
model (eg, a next-word prediction classifier) without the need of sharing their private training …
model (eg, a next-word prediction classifier) without the need of sharing their private training …
Spectral signatures in backdoor attacks
A recent line of work has uncovered a new form of data poisoning: so-called backdoor
attacks. These attacks are particularly dangerous because they do not affect a network's …
attacks. These attacks are particularly dangerous because they do not affect a network's …
Dataset security for machine learning: Data poisoning, backdoor attacks, and defenses
As machine learning systems grow in scale, so do their training data requirements, forcing
practitioners to automate and outsource the curation of training data in order to achieve state …
practitioners to automate and outsource the curation of training data in order to achieve state …
Robust estimators in high-dimensions without the computational intractability
We study high-dimensional distribution learning in an agnostic setting where an adversary is
allowed to arbitrarily corrupt an ε-fraction of the samples. Such questions have a rich history …
allowed to arbitrarily corrupt an ε-fraction of the samples. Such questions have a rich history …
Sever: A robust meta-algorithm for stochastic optimization
In high dimensions, most machine learning methods are brittle to even a small fraction of
structured outliers. To address this, we introduce a new meta-algorithm that can take in a …
structured outliers. To address this, we introduce a new meta-algorithm that can take in a …
Robust federated learning in a heterogeneous environment
We study a recently proposed large-scale distributed learning paradigm, namely Federated
Learning, where the worker machines are end users' own devices. Statistical and …
Learning, where the worker machines are end users' own devices. Statistical and …
Mean estimation and regression under heavy-tailed distributions: A survey
We survey some of the recent advances in mean estimation and regression function
estimation. In particular, we describe sub-Gaussian mean estimators for possibly heavy …
estimation. In particular, we describe sub-Gaussian mean estimators for possibly heavy …
A survey on heterogeneous federated learning
Federated learning (FL) has been proposed to protect data privacy and virtually assemble
the isolated data silos by cooperatively training models among organizations without …
the isolated data silos by cooperatively training models among organizations without …