Cluster validity indices for automatic clustering: A comprehensive review
AM Ikotun, F Habyarimana, AE Ezugwu - Heliyon, 2025 - cell.com
Abstract The Cluster Validity Index is an integral part of clustering algorithms. It evaluates
inter-cluster separation and intra-cluster cohesion of candidate clusters to determine the …
inter-cluster separation and intra-cluster cohesion of candidate clusters to determine the …
Optimization of K-means clustering method using hybrid capuchin search algorithm
Abstract This work presents Hybrid Capuchin Search Algorithm (HCSA) as a meta-heuristic
method to deal with the vexing problems of local optima traps and initialization sensitivity of …
method to deal with the vexing problems of local optima traps and initialization sensitivity of …
Overcoming weaknesses of density peak clustering using a data-dependent similarity measure
Abstract Density Peak Clustering (DPC) is a popular state-of-the-art clustering algorithm,
which requires pairwise (dis) similarity of data objects to detect arbitrary shaped clusters …
which requires pairwise (dis) similarity of data objects to detect arbitrary shaped clusters …
A user-centric analysis of social media for stock market prediction
Social media platforms such as Twitter or StockTwits are widely used for sharing stock
market opinions between investors, traders, and entrepreneurs. Empirically, previous work …
market opinions between investors, traders, and entrepreneurs. Empirically, previous work …
A probabilistic topic model based on short distance co-occurrences
A limitation of many probabilistic topic models such as Latent Dirichlet Allocation (LDA) is
their inflexibility to use local contexts. As a result, these models cannot directly benefit from …
their inflexibility to use local contexts. As a result, these models cannot directly benefit from …
A mask-based output layer for multi-level hierarchical classification
This paper proposes a novel mask-based output layer for multi-level hierarchical
classification, addressing the limitations of existing methods which (i) often do not embed the …
classification, addressing the limitations of existing methods which (i) often do not embed the …
Marine-tree: A Large-scale Marine Organisms Dataset for Hierarchical Image Classification
This paper presents Marine-tree, a large-scale hierarchical annotated dataset for marine
organism classification. Marine-tree contains more than 160k annotated images divided into …
organism classification. Marine-tree contains more than 160k annotated images divided into …
A longitudinal study of topic classification on Twitter
Twitter represents a massively distributed information source over topics ranging from social
and political events to entertainment and sports news. While recent work has suggested this …
and political events to entertainment and sports news. While recent work has suggested this …
A Generalized Framework for Predictive Clustering and Optimization
Clustering is a powerful and extensively used data science tool. While clustering is generally
thought of as an unsupervised learning technique, there are also supervised variations such …
thought of as an unsupervised learning technique, there are also supervised variations such …
[Retracted] Research on Information Retrieval Effectiveness of University Scientific Researchers Based on Mental Model
Y Zhang, Yiyang, J Yang - Wireless Communications and …, 2022 - Wiley Online Library
The information retrieval behavior of scientific researchers is a behavior that is affected by
multiple factors such as cognition, emotion, task, and user type and has its unique cognitive …
multiple factors such as cognition, emotion, task, and user type and has its unique cognitive …