A review of clustering techniques and developments
This paper presents a comprehensive study on clustering: exiting methods and
developments made at various times. Clustering is defined as an unsupervised learning …
developments made at various times. Clustering is defined as an unsupervised learning …
Nonconvex optimization meets low-rank matrix factorization: An overview
Substantial progress has been made recently on develo** provably accurate and efficient
algorithms for low-rank matrix factorization via nonconvex optimization. While conventional …
algorithms for low-rank matrix factorization via nonconvex optimization. While conventional …
Spectral methods for data science: A statistical perspective
Spectral methods have emerged as a simple yet surprisingly effective approach for
extracting information from massive, noisy and incomplete data. In a nutshell, spectral …
extracting information from massive, noisy and incomplete data. In a nutshell, spectral …
[HTML][HTML] Entrywise eigenvector analysis of random matrices with low expected rank
Recovering low-rank structures via eigenvector perturbation analysis is a common problem
in statistical machine learning, such as in factor analysis, community detection, ranking …
in statistical machine learning, such as in factor analysis, community detection, ranking …
A review of stochastic block models and extensions for graph clustering
There have been rapid developments in model-based clustering of graphs, also known as
block modelling, over the last ten years or so. We review different approaches and …
block modelling, over the last ten years or so. We review different approaches and …
A useful variant of the Davis–Kahan theorem for statisticians
Abstract The Davis–Kahan theorem is used in the analysis of many statistical procedures to
bound the distance between subspaces spanned by population eigenvectors and their …
bound the distance between subspaces spanned by population eigenvectors and their …
Exact recovery in the stochastic block model
The stochastic block model with two communities, or equivalently the planted bisection
model, is a popular model of random graph exhibiting a cluster behavior. In the symmetric …
model, is a popular model of random graph exhibiting a cluster behavior. In the symmetric …
Consistency of spectral clustering in stochastic block models
We analyze the performance of spectral clustering for community extraction in stochastic
block models. We show that, under mild conditions, spectral clustering applied to the …
block models. We show that, under mild conditions, spectral clustering applied to the …
Community detection in general stochastic block models: Fundamental limits and efficient algorithms for recovery
E Abbe, C Sandon - 2015 IEEE 56th Annual Symposium on …, 2015 - ieeexplore.ieee.org
New phase transition phenomena have recently been discovered for the stochastic block
model, for the special case of two non-overlap** symmetric communities. This gives raise …
model, for the special case of two non-overlap** symmetric communities. This gives raise …
Statistical inference on random dot product graphs: a survey
The random dot product graph (RDPG) is an independent-edge random graph that is
analytically tractable and, simultaneously, either encompasses or can successfully …
analytically tractable and, simultaneously, either encompasses or can successfully …