A systematic literature review on identifying patterns using unsupervised clustering algorithms: A data mining perspective

M Chaudhry, I Shafi, M Mahnoor, DLR Vargas… - Symmetry, 2023 - mdpi.com
Data mining is an analytical approach that contributes to achieving a solution to many
problems by extracting previously unknown, fascinating, nontrivial, and potentially valuable …

Fast hierarchical Bayesian analysis of population structure

G Tonkin-Hill, JA Lees, SD Bentley… - Nucleic acids …, 2019 - academic.oup.com
We present fastbaps, a fast solution to the genetic clustering problem. Fastbaps rapidly
identifies an approximate fit to a Dirichlet process mixture model (DPM) for clustering …

Consumers' affective needs matter: Open innovation through mining luxury hotels' online reviews

J Wu, T Yang, Z Zhou, N Zhao - International Journal of Hospitality …, 2023 - Elsevier
Extant research on hotel open innovation rarely explores innovative ideas from customer-
generated online reviews and pays little attention to consumers' affective needs. To bridge …

Hierarchical clustering supported by reciprocal nearest neighbors

WB **e, YL Lee, C Wang, DB Chen, T Zhou - Information Sciences, 2020 - Elsevier
Clustering is a fundamental tool aiming at classifying data points into groups based on their
pairwise distances or similarities. It has found successful applications in all natural and …

Comparing the qualitative performances of handheld NIR and Raman spectrophotometers for the detection of falsified pharmaceutical products

PH Ciza, PY Sacre, C Waffo, L Coïc, H Avohou… - Talanta, 2019 - Elsevier
Over the last decade, the growth of the global pharmaceutical market has led to an overall
increase of substandard and falsified drugs especially on the African market (or emerging …

Scalable clustering by aggregating representatives in hierarchical groups

WB **e, Z Liu, D Das, B Chen, J Srivastava - Pattern Recognition, 2023 - Elsevier
Appropriately handling the scalability of clustering is a long-standing challenge for the study
of clustering techniques and is of fundamental interest to researchers in the community of …

[HTML][HTML] Fundamental clustering algorithms suite

MC Thrun, Q Stier - SoftwareX, 2021 - Elsevier
The article presents immediate access to over fifty fundamental clustering algorithms.
Additionally, access to clustering benchmark datasets published priorly as “Fundamental …

Are cluster validity measures (in) valid?

M Gagolewski, M Bartoszuk, A Cena - Information Sciences, 2021 - Elsevier
Internal cluster validity measures (such as the Calinski–Harabasz, Dunn, or Davies–Bouldin
indices) are frequently used for selecting the appropriate number of partitions a dataset …

WEClustering: word embeddings based text clustering technique for large datasets

V Mehta, S Bawa, J Singh - Complex & intelligent systems, 2021 - Springer
A massive amount of textual data now exists in digital repositories in the form of research
articles, news articles, reviews, Wikipedia articles, and books, etc. Text clustering is a …

Inferring population structure in biobank-scale genomic data

AM Chiu, EK Molloy, Z Tan, A Talwalkar… - The American Journal of …, 2022 - cell.com
Inferring the structure of human populations from genetic variation data is a key task in
population and medical genomic studies. Although a number of methods for population …