K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data

AM Ikotun, AE Ezugwu, L Abualigah, B Abuhaija… - Information …, 2023 - Elsevier
Advances in recent techniques for scientific data collection in the era of big data allow for the
systematic accumulation of large quantities of data at various data-capturing sites. Similarly …

A review and evaluation of elastic distance functions for time series clustering

C Holder, M Middlehurst, A Bagnall - Knowledge and Information Systems, 2024 - Springer
Time series clustering is the act of grou** time series data without recourse to a label.
Algorithms that cluster time series can be classified into two groups: those that employ a time …

[HTML][HTML] Fast and eager k-medoids clustering: O (k) runtime improvement of the PAM, CLARA, and CLARANS algorithms

E Schubert, PJ Rousseeuw - Information Systems, 2021 - Elsevier
Clustering non-Euclidean data is difficult, and one of the most used algorithms besides
hierarchical clustering is the popular algorithm Partitioning Around Medoids (PAM), also …

A fast adaptive k-means with no bounds

S **a, D Peng, D Meng, C Zhang, G Wang… - IEEE Transactions on …, 2020 - par.nsf.gov
This paper presents a novel accelerated exact k-means called as" Ball k-means" by using
the ball to describe each cluster, which focus on reducing the point-centroid distance …

Ball -Means: Fast Adaptive Clustering With No Bounds

S **a, D Peng, D Meng, C Zhang… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
This paper presents a novel accelerated exact-means called as “Ball-means” by using the
ball to describe each cluster, which focus on reducing the point-centroid distance …

{ModelKeeper}: Accelerating {DNN} training via automated training warmup

F Lai, Y Dai, HV Madhyastha… - 20th USENIX Symposium …, 2023 - usenix.org
With growing deployment of machine learning (ML) models, ML developers are training or re-
training increasingly more deep neural networks (DNNs). They do so to find the most …

K-means clustering with natural density peaks for discovering arbitrary-shaped clusters

D Cheng, J Huang, S Zhang, S **a… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Due to simplicity, K-means has become a widely used clustering method. However, its
clustering result is seriously affected by the initial centers and the allocation strategy makes …

Efficient Multi-View K-Means for Image Clustering

H Lu, H Xu, Q Wang, Q Gao, M Yang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Nowadays, data in the real world often comes from multiple sources, but most existing multi-
view-Means perform poorly on linearly non-separable data and require initializing the …

K-means-G*: Accelerating k-means clustering algorithm utilizing primitive geometric concepts

H Ismkhan, M Izadi - Information Sciences, 2022 - Elsevier
The k-means is the most popular clustering algorithm, but, as it needs too many distance
computations, its speed is dramatically fall down against high-dimensional data. Although …

Banditpam: Almost linear time k-medoids clustering via multi-armed bandits

M Tiwari, MJ Zhang, J Mayclin… - Advances in …, 2020 - proceedings.neurips.cc
Clustering is a ubiquitous task in data science. Compared to the commonly used k-means
clustering, k-medoids clustering requires the cluster centers to be actual data points and …