Byzantine machine learning: A primer

R Guerraoui, N Gupta, R Pinot - ACM Computing Surveys, 2024 - dl.acm.org
The problem of Byzantine resilience in distributed machine learning, aka Byzantine machine
learning, consists of designing distributed algorithms that can train an accurate model …

Recent advances in algorithmic high-dimensional robust statistics

I Diakonikolas, DM Kane - arxiv preprint arxiv:1911.05911, 2019 - arxiv.org
Learning in the presence of outliers is a fundamental problem in statistics. Until recently, all
known efficient unsupervised learning algorithms were very sensitive to outliers in high …

Data collection and quality challenges in deep learning: A data-centric ai perspective

SE Whang, Y Roh, H Song, JG Lee - The VLDB Journal, 2023 - Springer
Data-centric AI is at the center of a fundamental shift in software engineering where machine
learning becomes the new software, powered by big data and computing infrastructure …

Teaser: Fast and certifiable point cloud registration

H Yang, J Shi, L Carlone - IEEE Transactions on Robotics, 2020 - ieeexplore.ieee.org
We propose the first fast and certifiable algorithm for the registration of two sets of three-
dimensional (3-D) points in the presence of large amounts of outlier correspondences. A …

Remember what you want to forget: Algorithms for machine unlearning

A Sekhari, J Acharya, G Kamath… - Advances in Neural …, 2021 - proceedings.neurips.cc
We study the problem of unlearning datapoints from a learnt model. The learner first
receives a dataset $ S $ drawn iid from an unknown distribution, and outputs a model …

Spectral signatures in backdoor attacks

B Tran, J Li, A Madry - Advances in neural information …, 2018 - proceedings.neurips.cc
A recent line of work has uncovered a new form of data poisoning: so-called backdoor
attacks. These attacks are particularly dangerous because they do not affect a network's …

Robust aggregation for federated learning

K Pillutla, SM Kakade… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
We present a novel approach to federated learning that endows its aggregation process with
greater robustness to potential poisoning of local data or model parameters of participating …

Byzantine-robust distributed learning: Towards optimal statistical rates

D Yin, Y Chen, R Kannan… - … conference on machine …, 2018 - proceedings.mlr.press
In this paper, we develop distributed optimization algorithms that are provably robust against
Byzantine failures—arbitrary and potentially adversarial behavior, in distributed computing …

Dataset security for machine learning: Data poisoning, backdoor attacks, and defenses

M Goldblum, D Tsipras, C **e, X Chen… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
As machine learning systems grow in scale, so do their training data requirements, forcing
practitioners to automate and outsource the curation of training data in order to achieve state …

Certified defenses for data poisoning attacks

J Steinhardt, PWW Koh… - Advances in neural …, 2017 - proceedings.neurips.cc
Abstract Machine learning systems trained on user-provided data are susceptible to data
poisoning attacks, whereby malicious users inject false training data with the aim of …