A review of clustering techniques and developments

A Saxena, M Prasad, A Gupta, N Bharill, OP Patel… - Neurocomputing, 2017 - Elsevier
This paper presents a comprehensive study on clustering: exiting methods and
developments made at various times. Clustering is defined as an unsupervised learning …

Nonconvex optimization meets low-rank matrix factorization: An overview

Y Chi, YM Lu, Y Chen - IEEE Transactions on Signal …, 2019 - ieeexplore.ieee.org
Substantial progress has been made recently on develo** provably accurate and efficient
algorithms for low-rank matrix factorization via nonconvex optimization. While conventional …

Spectral methods for data science: A statistical perspective

Y Chen, Y Chi, J Fan, C Ma - Foundations and Trends® in …, 2021 - nowpublishers.com
Spectral methods have emerged as a simple yet surprisingly effective approach for
extracting information from massive, noisy and incomplete data. In a nutshell, spectral …

[HTML][HTML] Entrywise eigenvector analysis of random matrices with low expected rank

E Abbe, J Fan, K Wang, Y Zhong - Annals of statistics, 2020 - ncbi.nlm.nih.gov
Recovering low-rank structures via eigenvector perturbation analysis is a common problem
in statistical machine learning, such as in factor analysis, community detection, ranking …

A review of stochastic block models and extensions for graph clustering

C Lee, DJ Wilkinson - Applied Network Science, 2019 - Springer
There have been rapid developments in model-based clustering of graphs, also known as
block modelling, over the last ten years or so. We review different approaches and …

A useful variant of the Davis–Kahan theorem for statisticians

Y Yu, T Wang, RJ Samworth - Biometrika, 2015 - academic.oup.com
Abstract The Davis–Kahan theorem is used in the analysis of many statistical procedures to
bound the distance between subspaces spanned by population eigenvectors and their …

Exact recovery in the stochastic block model

E Abbe, AS Bandeira, G Hall - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
The stochastic block model with two communities, or equivalently the planted bisection
model, is a popular model of random graph exhibiting a cluster behavior. In the symmetric …

Consistency of spectral clustering in stochastic block models

J Lei, A Rinaldo - The Annals of Statistics, 2015 - JSTOR
We analyze the performance of spectral clustering for community extraction in stochastic
block models. We show that, under mild conditions, spectral clustering applied to the …

Community detection in general stochastic block models: Fundamental limits and efficient algorithms for recovery

E Abbe, C Sandon - 2015 IEEE 56th Annual Symposium on …, 2015 - ieeexplore.ieee.org
New phase transition phenomena have recently been discovered for the stochastic block
model, for the special case of two non-overlap** symmetric communities. This gives raise …

Statistical inference on random dot product graphs: a survey

A Athreya, DE Fishkind, M Tang, CE Priebe… - Journal of Machine …, 2018 - jmlr.org
The random dot product graph (RDPG) is an independent-edge random graph that is
analytically tractable and, simultaneously, either encompasses or can successfully …