K-means-G*: Accelerating k-means clustering algorithm utilizing primitive geometric concepts
The k-means is the most popular clustering algorithm, but, as it needs too many distance
computations, its speed is dramatically fall down against high-dimensional data. Although …
computations, its speed is dramatically fall down against high-dimensional data. Although …
DBGSA: A novel data adaptive bregman clustering algorithm
Y **ao, H Li, Y Zhang - Engineering Applications of Artificial Intelligence, 2024 - Elsevier
Traditional clustering algorithms such as K-means are highly sensitive to the initial centroid
selection and perform poorly on non-convex dataset. To address these problems, a novel …
selection and perform poorly on non-convex dataset. To address these problems, a novel …
High-order manifold regularized multi-view subspace clustering with robust affinity matrices and weighted TNN
B Cai, GF Lu, L Yao, H Li - Pattern Recognition, 2023 - Elsevier
Multi-view subspace clustering achieves impressive performance for high-dimensional data.
However, many of these models do not sufficiently mine the intrinsic information among …
However, many of these models do not sufficiently mine the intrinsic information among …
Clustering above exponential families with tempered exponential measures
The link with exponential families has allowed k-means clustering to be generalized to a
wide variety of data-generating distributions in exponential families and clustering …
wide variety of data-generating distributions in exponential families and clustering …
Bregman power k-means for clustering exponential family data
Recent progress in center-based clustering algorithms combats poor local minima by implicit
annealing through a family of generalized means. These methods are variations of Lloyd's …
annealing through a family of generalized means. These methods are variations of Lloyd's …
Robust principal component analysis: a median of means approach
Principal component analysis (PCA) is a fundamental tool for data visualization, denoising,
and dimensionality reduction. It is widely popular in statistics, machine learning, computer …
and dimensionality reduction. It is widely popular in statistics, machine learning, computer …
Gradient based clustering
We propose a general approach for distance based clustering, using the gradient of the cost
function that measures clustering quality with respect to cluster assignments and cluster …
function that measures clustering quality with respect to cluster assignments and cluster …
Convergence of online k-means
We prove asymptotic convergence for a general class of k-means algorithms performed over
streaming data from a distribution–the centers asymptotically converge to the set of …
streaming data from a distribution–the centers asymptotically converge to the set of …
Distributed Center-based Clustering: A Unified Framework
We develop a family of distributed center-based clustering algorithms that work over
connected networks of users. In the proposed scenario, users contain a local dataset and …
connected networks of users. In the proposed scenario, users contain a local dataset and …
Clustering High-dimensional Data with Ordered Weighted Regularization
Clustering complex high-dimensional data is particularly challenging as the signal-to-noise
ratio in such data is significantly lower than their classical counterparts. This is mainly …
ratio in such data is significantly lower than their classical counterparts. This is mainly …