Recent advances in algorithmic high-dimensional robust statistics

I Diakonikolas, DM Kane - arxiv preprint arxiv:1911.05911, 2019 - arxiv.org
Learning in the presence of outliers is a fundamental problem in statistics. Until recently, all
known efficient unsupervised learning algorithms were very sensitive to outliers in high …

Learning mixtures of gaussians using the DDPM objective

K Shah, S Chen, A Klivans - Advances in Neural …, 2023 - proceedings.neurips.cc
Recent works have shown that diffusion models can learn essentially any distribution
provided one can perform score estimation. Yet it remains poorly understood under what …

Robust estimators in high-dimensions without the computational intractability

I Diakonikolas, G Kamath, D Kane, J Li, A Moitra… - SIAM Journal on …, 2019 - SIAM
We study high-dimensional distribution learning in an agnostic setting where an adversary is
allowed to arbitrarily corrupt an ε-fraction of the samples. Such questions have a rich history …

Sever: A robust meta-algorithm for stochastic optimization

I Diakonikolas, G Kamath, D Kane, J Li… - International …, 2019 - proceedings.mlr.press
In high dimensions, most machine learning methods are brittle to even a small fraction of
structured outliers. To address this, we introduce a new meta-algorithm that can take in a …

Estimation contracts for outlier-robust geometric perception

L Carlone - Foundations and Trends® in Robotics, 2023 - nowpublishers.com
Outlier-robust estimation is a fundamental problem and has been extensively investigated
by statisticians and practitioners. The last few years have seen a convergence across …

Private distribution learning with public data: The view from sample compression

S Ben-David, A Bie, CL Canonne… - Advances in …, 2023 - proceedings.neurips.cc
We study the problem of private distribution learning with access to public data. In this setup,
which we refer to as* public-private learning*, the learner is given public and private …

Robust and differentially private mean estimation

X Liu, W Kong, S Kakade, S Oh - Advances in neural …, 2021 - proceedings.neurips.cc
In statistical learning and analysis from shared data, which is increasingly widely adopted in
platforms such as federated learning and meta-learning, there are two major concerns …

An analysis of the t-sne algorithm for data visualization

S Arora, W Hu, PK Kothari - Conference on learning theory, 2018 - proceedings.mlr.press
A first line of attack in exploratory data analysis is\emph {data visualization}, ie, generating a
2-dimensional representation of data that makes\emph {clusters} of similar points visually …

The curse of concentration in robust learning: Evasion and poisoning attacks from concentration of measure

S Mahloujifar, DI Diochnos, M Mahmoody - Proceedings of the AAAI …, 2019 - ojs.aaai.org
Many modern machine learning classifiers are shown to be vulnerable to adversarial
perturbations of the instances. Despite a massive amount of work focusing on making …

Efficient algorithms and lower bounds for robust linear regression

I Diakonikolas, W Kong, A Stewart - Proceedings of the Thirtieth Annual ACM …, 2019 - SIAM
We study the prototypical problem of high-dimensional linear regression in a robust model
where an ε-fraction of the samples can be adversarially corrupted. We focus on the …