A systematic literature review on identifying patterns using unsupervised clustering algorithms: A data mining perspective
Data mining is an analytical approach that contributes to achieving a solution to many
problems by extracting previously unknown, fascinating, nontrivial, and potentially valuable …
problems by extracting previously unknown, fascinating, nontrivial, and potentially valuable …
Fast hierarchical Bayesian analysis of population structure
We present fastbaps, a fast solution to the genetic clustering problem. Fastbaps rapidly
identifies an approximate fit to a Dirichlet process mixture model (DPM) for clustering …
identifies an approximate fit to a Dirichlet process mixture model (DPM) for clustering …
Consumers' affective needs matter: Open innovation through mining luxury hotels' online reviews
Extant research on hotel open innovation rarely explores innovative ideas from customer-
generated online reviews and pays little attention to consumers' affective needs. To bridge …
generated online reviews and pays little attention to consumers' affective needs. To bridge …
Hierarchical clustering supported by reciprocal nearest neighbors
Clustering is a fundamental tool aiming at classifying data points into groups based on their
pairwise distances or similarities. It has found successful applications in all natural and …
pairwise distances or similarities. It has found successful applications in all natural and …
Comparing the qualitative performances of handheld NIR and Raman spectrophotometers for the detection of falsified pharmaceutical products
Over the last decade, the growth of the global pharmaceutical market has led to an overall
increase of substandard and falsified drugs especially on the African market (or emerging …
increase of substandard and falsified drugs especially on the African market (or emerging …
Scalable clustering by aggregating representatives in hierarchical groups
Appropriately handling the scalability of clustering is a long-standing challenge for the study
of clustering techniques and is of fundamental interest to researchers in the community of …
of clustering techniques and is of fundamental interest to researchers in the community of …
[HTML][HTML] Fundamental clustering algorithms suite
The article presents immediate access to over fifty fundamental clustering algorithms.
Additionally, access to clustering benchmark datasets published priorly as “Fundamental …
Additionally, access to clustering benchmark datasets published priorly as “Fundamental …
Are cluster validity measures (in) valid?
Internal cluster validity measures (such as the Calinski–Harabasz, Dunn, or Davies–Bouldin
indices) are frequently used for selecting the appropriate number of partitions a dataset …
indices) are frequently used for selecting the appropriate number of partitions a dataset …
WEClustering: word embeddings based text clustering technique for large datasets
A massive amount of textual data now exists in digital repositories in the form of research
articles, news articles, reviews, Wikipedia articles, and books, etc. Text clustering is a …
articles, news articles, reviews, Wikipedia articles, and books, etc. Text clustering is a …
Inferring population structure in biobank-scale genomic data
Inferring the structure of human populations from genetic variation data is a key task in
population and medical genomic studies. Although a number of methods for population …
population and medical genomic studies. Although a number of methods for population …