Cluster validity indices for automatic clustering: A comprehensive review

AM Ikotun, F Habyarimana, AE Ezugwu - Heliyon, 2025 - cell.com
Abstract The Cluster Validity Index is an integral part of clustering algorithms. It evaluates
inter-cluster separation and intra-cluster cohesion of candidate clusters to determine the …

Optimization of K-means clustering method using hybrid capuchin search algorithm

A Qtaish, M Braik, D Albashish, MT Alshammari… - The Journal of …, 2024 - Springer
Abstract This work presents Hybrid Capuchin Search Algorithm (HCSA) as a meta-heuristic
method to deal with the vexing problems of local optima traps and initialization sensitivity of …

Overcoming weaknesses of density peak clustering using a data-dependent similarity measure

Z Rasool, S Aryal, MR Bouadjenek, R Dazeley - Pattern Recognition, 2023 - Elsevier
Abstract Density Peak Clustering (DPC) is a popular state-of-the-art clustering algorithm,
which requires pairwise (dis) similarity of data objects to detect arbitrary shaped clusters …

A user-centric analysis of social media for stock market prediction

MR Bouadjenek, S Sanner, G Wu - ACM Transactions on the Web, 2023 - dl.acm.org
Social media platforms such as Twitter or StockTwits are widely used for sharing stock
market opinions between investors, traders, and entrepreneurs. Empirically, previous work …

A probabilistic topic model based on short distance co-occurrences

M Rahimi, M Zahedi, H Mashayekhi - Expert Systems with Applications, 2022 - Elsevier
A limitation of many probabilistic topic models such as Latent Dirichlet Allocation (LDA) is
their inflexibility to use local contexts. As a result, these models cannot directly benefit from …

A mask-based output layer for multi-level hierarchical classification

T Boone-Sifuentes, MR Bouadjenek, I Razzak… - Proceedings of the 31st …, 2022 - dl.acm.org
This paper proposes a novel mask-based output layer for multi-level hierarchical
classification, addressing the limitations of existing methods which (i) often do not embed the …

Marine-tree: A Large-scale Marine Organisms Dataset for Hierarchical Image Classification

T Boone-Sifuentes, A Nazari, I Razzak… - Proceedings of the 31st …, 2022 - dl.acm.org
This paper presents Marine-tree, a large-scale hierarchical annotated dataset for marine
organism classification. Marine-tree contains more than 160k annotated images divided into …

A longitudinal study of topic classification on Twitter

MR Bouadjenek, S Sanner, Z Iman, L **e… - PeerJ Computer …, 2022 - peerj.com
Twitter represents a massively distributed information source over topics ranging from social
and political events to entertainment and sports news. While recent work has suggested this …

A Generalized Framework for Predictive Clustering and Optimization

A Chembu, S Sanner - arxiv preprint arxiv:2305.04364, 2023 - arxiv.org
Clustering is a powerful and extensively used data science tool. While clustering is generally
thought of as an unsupervised learning technique, there are also supervised variations such …

[Retracted] Research on Information Retrieval Effectiveness of University Scientific Researchers Based on Mental Model

Y Zhang, Yiyang, J Yang - Wireless Communications and …, 2022 - Wiley Online Library
The information retrieval behavior of scientific researchers is a behavior that is affected by
multiple factors such as cognition, emotion, task, and user type and has its unique cognitive …