Non-convex optimization for machine learning

P Jain, P Kar - Foundations and Trends® in Machine …, 2017 - nowpublishers.com
A vast majority of machine learning algorithms train their models and perform inference by
solving optimization problems. In order to capture the learning and prediction problems …

Fast algorithms for robust PCA via gradient descent

X Yi, D Park, Y Chen… - Advances in neural …, 2016 - proceedings.neurips.cc
We consider the problem of Robust PCA in the fully and partially observed settings. Without
corruptions, this is the well-known matrix completion problem. From a statistical standpoint …

Selective inference for k-means clustering

YT Chen, DM Witten - Journal of Machine Learning Research, 2023 - jmlr.org
We consider the problem of testing for a difference in means between clusters of
observations identified via k-means clustering. In this setting, classical hypothesis tests lead …

Reducibility and statistical-computational gaps from secret leakage

M Brennan, G Bresler - Conference on Learning Theory, 2020 - proceedings.mlr.press
Inference problems with conjectured statistical-computational gaps are ubiquitous
throughout modern statistics, computer science, statistical physics and discrete probability …

Store: sparse tensor response regression and neuroimaging analysis

WW Sun, L Li - Journal of Machine Learning Research, 2017 - jmlr.org
Motivated by applications in neuroimaging analysis, we propose a new regression model,
Sparse TensOr REsponse regression (STORE), with a tensor response and a vector …

CHIME: Clustering of high-dimensional Gaussian mixtures with EM algorithm and its optimality

TT Cai, J Ma, L Zhang - 2019 - projecteuclid.org
Supplement to “CHIME: Clustering of high-dimensional Gaussian mixtures with EM
algorithm and its optimality”. This supplement provides detailed proofs of the Theorem 3.1 …

Global convergence of the EM algorithm for mixtures of two component linear regression

J Kwon, W Qian, C Caramanis… - … on Learning Theory, 2019 - proceedings.mlr.press
Abstract The Expectation-Maximization algorithm is perhaps the most broadly used
algorithm for inference of latent variable problems. A theoretical understanding of its …

A doubly enhanced em algorithm for model-based tensor clustering

Q Mai, X Zhang, Y Pan, K Deng - Journal of the American Statistical …, 2022 - Taylor & Francis
Modern scientific studies often collect datasets in the form of tensors. These datasets call for
innovative statistical analysis methods. In particular, there is a pressing need for tensor …

Simultaneous clustering and estimation of heterogeneous graphical models

B Hao, WW Sun, Y Liu, G Cheng - Journal of Machine Learning Research, 2018 - jmlr.org
We consider joint estimation of multiple graphical models arising from heterogeneous and
high-dimensional observations. Unlike most previous approaches which assume that the …

Singularity, misspecification and the convergence rate of EM

R Dwivedi, N Ho, K Khamaru, MJ Wainwright… - The Annals of …, 2020 - JSTOR
A line of recent work has analyzed the behavior of the Expectation-Maximization (EM)
algorithm in the well-specified setting, in which the population likelihood is locally strongly …