Byzantine machine learning: A primer
The problem of Byzantine resilience in distributed machine learning, aka Byzantine machine
learning, consists of designing distributed algorithms that can train an accurate model …
learning, consists of designing distributed algorithms that can train an accurate model …
Recent advances in algorithmic high-dimensional robust statistics
Learning in the presence of outliers is a fundamental problem in statistics. Until recently, all
known efficient unsupervised learning algorithms were very sensitive to outliers in high …
known efficient unsupervised learning algorithms were very sensitive to outliers in high …
Data collection and quality challenges in deep learning: A data-centric ai perspective
Data-centric AI is at the center of a fundamental shift in software engineering where machine
learning becomes the new software, powered by big data and computing infrastructure …
learning becomes the new software, powered by big data and computing infrastructure …
Teaser: Fast and certifiable point cloud registration
We propose the first fast and certifiable algorithm for the registration of two sets of three-
dimensional (3-D) points in the presence of large amounts of outlier correspondences. A …
dimensional (3-D) points in the presence of large amounts of outlier correspondences. A …
Remember what you want to forget: Algorithms for machine unlearning
We study the problem of unlearning datapoints from a learnt model. The learner first
receives a dataset $ S $ drawn iid from an unknown distribution, and outputs a model …
receives a dataset $ S $ drawn iid from an unknown distribution, and outputs a model …
Spectral signatures in backdoor attacks
A recent line of work has uncovered a new form of data poisoning: so-called backdoor
attacks. These attacks are particularly dangerous because they do not affect a network's …
attacks. These attacks are particularly dangerous because they do not affect a network's …
Robust aggregation for federated learning
We present a novel approach to federated learning that endows its aggregation process with
greater robustness to potential poisoning of local data or model parameters of participating …
greater robustness to potential poisoning of local data or model parameters of participating …
Byzantine-robust distributed learning: Towards optimal statistical rates
In this paper, we develop distributed optimization algorithms that are provably robust against
Byzantine failures—arbitrary and potentially adversarial behavior, in distributed computing …
Byzantine failures—arbitrary and potentially adversarial behavior, in distributed computing …
Dataset security for machine learning: Data poisoning, backdoor attacks, and defenses
As machine learning systems grow in scale, so do their training data requirements, forcing
practitioners to automate and outsource the curation of training data in order to achieve state …
practitioners to automate and outsource the curation of training data in order to achieve state …
Certified defenses for data poisoning attacks
Abstract Machine learning systems trained on user-provided data are susceptible to data
poisoning attacks, whereby malicious users inject false training data with the aim of …
poisoning attacks, whereby malicious users inject false training data with the aim of …