Survey of clustering algorithms

R Xu, D Wunsch - IEEE Transactions on neural networks, 2005 - ieeexplore.ieee.org
Data analysis plays an indispensable role for understanding various phenomena. Cluster
analysis, primitive exploration with little or no prior knowledge, consists of research …

Algorithms for hierarchical clustering: an overview

F Murtagh, P Contreras - Wiley Interdisciplinary Reviews: Data …, 2012 - Wiley Online Library
We survey agglomerative hierarchical clustering algorithms and discuss efficient
implementations that are available in R and other software environments. We look at …

[PDF][PDF] A k-means clustering algorithm

JA Hartigan, MA Wong - Applied statistics, 1979 - danida.vnu.edu.vn
METHOD The algorithm requires as input a matrix of M points in N dimensions and a matrix
of K initial cluster centres in N dimensions. The number of points in cluster L is denoted by …

A survey of clustering data mining techniques

P Berkhin - Grou** multidimensional data: Recent advances in …, 2006 - Springer
Clustering is the division of data into groups of similar objects. In clustering, some details are
disregarded in exchange for data simplification. Clustering can be viewed as a data …

A survey of clustering algorithms for big data: Taxonomy and empirical analysis

A Fahad, N Alshatri, Z Tari, A Alamri… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
Clustering algorithms have emerged as an alternative powerful meta-learning tool to
accurately analyze the massive volume of data generated by modern applications. In …

[BOOK][B] Web data mining: exploring hyperlinks, contents, and usage data

B Liu - 2011 - Springer
Liu has written a comprehensive text on Web mining, which consists of two parts. The first
part covers the data mining and machine learning foundations, where all the essential …

[BOOK][B] Data clustering: theory, algorithms, and applications

G Gan, C Ma, J Wu - 2020 - SIAM
The monograph Data Clustering: Theory, Algorithms, and Applications was published in
2007. Starting with the common ground and knowledge for data clustering, the monograph …

K-means properties on six clustering benchmark datasets

P Fränti, S Sieranoja - Applied intelligence, 2018 - Springer
This paper has two contributions. First, we introduce a clustering basic benchmark. Second,
we study the performance of k-means using this benchmark. Specifically, we measure how …

Big data for cyber physical systems in industry 4.0: a survey

LD Xu, L Duan - Enterprise Information Systems, 2019 - Taylor & Francis
With the technology development in cyber physical systems and big data, there are huge
potential to apply them to achieve personalization and improve resource efficiency in …

Subspace clustering for high dimensional data: a review

L Parsons, E Haque, H Liu - Acm sigkdd explorations newsletter, 2004 - dl.acm.org
Subspace clustering is an extension of traditional clustering that seeks to find clusters in
different subspaces within a dataset. Often in high dimensional data, many dimensions are …