A selective overview of sparse principal component analysis

H Zou, L Xue - Proceedings of the IEEE, 2018 - ieeexplore.ieee.org
Principal component analysis (PCA) is a widely used technique for dimension reduction,
data processing, and feature extraction. The three tasks are particularly useful and important …

[КНИГА][B] Statistical foundations of data science

J Fan, R Li, CH Zhang, H Zou - 2020 - taylorfrancis.com
Statistical Foundations of Data Science gives a thorough introduction to commonly used
statistical models, contemporary statistical machine learning techniques and algorithms …

Fast low-rank estimation by projected gradient descent: General statistical and algorithmic guarantees

Y Chen, MJ Wainwright - arxiv preprint arxiv:1509.03025, 2015 - arxiv.org
Optimization problems with rank constraints arise in many applications, including matrix
regression, structured PCA, matrix completion and matrix decomposition problems. An …

Statistical query lower bounds for robust estimation of high-dimensional gaussians and gaussian mixtures

I Diakonikolas, DM Kane… - 2017 IEEE 58th Annual …, 2017 - ieeexplore.ieee.org
We describe a general technique that yields the first Statistical Query lower bounds for a
range of fundamental high-dimensional learning problems involving Gaussian distributions …

Hutch++: Optimal stochastic trace estimation

RA Meyer, C Musco, C Musco, DP Woodruff - Symposium on Simplicity in …, 2021 - SIAM
We study the problem of estimating the trace of a matrix A that can only be accessed through
matrix-vector multiplication. We introduce a new randomized algorithm, Hutch++, which …

Rate-optimal perturbation bounds for singular subspaces with applications to high-dimensional statistics

TT Cai, A Zhang - 2018 - projecteuclid.org
Supplement to “Rate-optimal perturbation bounds for singular subspaces with applications
to high-dimensional statistics”. The supplementary material includes the proofs for Theorem …

Asymptotics of empirical eigenstructure for high dimensional spiked covariance

W Wang, J Fan - Annals of statistics, 2017 - pmc.ncbi.nlm.nih.gov
We derive the asymptotic distributions of the spiked eigenvalues and eigenvectors under a
generalized and unified asymptotic regime, which takes into account the magnitude of …

Sparse PCA: Optimal rates and adaptive estimation

TT Cai, Z Ma, Y Wu - 2013 - projecteuclid.org
Sparse PCA: Optimal rates and adaptive estimation Page 1 The Annals of Statistics 2013, Vol.
41, No. 6, 3074–3110 DOI: 10.1214/13-AOS1178 © Institute of Mathematical Statistics, 2013 …

PCA in high dimensions: An orientation

IM Johnstone, D Paul - Proceedings of the IEEE, 2018 - ieeexplore.ieee.org
When the data are high dimensional, widely used multivariate statistical methods such as
principal component analysis can behave in unexpected ways. In settings where the …