PPanGGOLiN: depicting microbial diversity via a partitioned pangenome graph

G Gautreau, A Bazin, M Gachet, R Planel… - PLoS computational …, 2020‏ - journals.plos.org
The use of comparative genomics for functional, evolutionary, and epidemiological studies
requires methods to classify gene families in terms of occurrence in a given species. These …

A Comprehensive Survey on Recent Feature Selection Methods for Mixed Data: Challenges, Solutions and Future Directions

ME Warkiani, MH Moattar - Neurocomputing, 2025‏ - Elsevier
Feature selection plays a crucial role in data analysis, pattern recognition, and machine
learning by reducing feature dimensions, improving execution time, and enhancing model …

[HTML][HTML] Grid-based clustering using boundary detection

M Du, F Wu - Entropy, 2022‏ - mdpi.com
Clustering can be divided into five categories: partitioning, hierarchical, model-based,
density-based, and grid-based algorithms. Among them, grid-based clustering is highly …

Pharmacoprint: A combination of a pharmacophore fingerprint and artificial intelligence as a tool for computer-aided drug design

D Warszycki, Ł Struski, M Smieja, R Kafel… - Journal of chemical …, 2021‏ - ACS Publications
Structural fingerprints and pharmacophore modeling are methodologies that have been
used for at least 2 decades in various fields of cheminformatics, from similarity searching to …

[Retracted] An Ensemble Clustering Approach (Consensus Clustering) for High‐Dimensional Data

J Yan, W Liu - Security and Communication Networks, 2022‏ - Wiley Online Library
Due to the plurality of irrelevant attributes, sparse distribution, and complicated calculations
in high‐dimensional data, traditional clustering algorithms, such as K‐means, do not perform …

A phase angle-modulated bat algorithm with application to antenna topology optimization

J Dong, Z Wang, J Mo - Applied Sciences, 2021‏ - mdpi.com
This paper proposes a phase angle-modulated bat algorithm (P-AMBA) for high-
dimensional binary optimization. The idea was to reduce the optimization time by …

Information architecture: using best merge method, category validity, and multidimensional scaling for open card sort data analysis

S Paea, C Katsanos, G Bulivou - International Journal of Human …, 2024‏ - Taylor & Francis
Open card sorting is a widely used method in HCI for the design of user-centered
Information Architectures (IAs). This article proposes a new algorithm that combines the best …

Latent skill mining and labeling from courseware content

N Matsuda, J Wood, R Shrivastava… - Journal of educational …, 2022‏ - par.nsf.gov
A model that maps the requisite skills, or knowledge components, to the contents of an
online course is necessary to implement many adaptive learning technologies. However …

Interactive information bottleneck for high-dimensional co-occurrence data clustering

S Hu, R Wang, Y Ye - Applied Soft Computing, 2021‏ - Elsevier
Clustering high-dimensional data is quite challenging due to lots of redundant and irrelevant
information contained in features. Most existing methods sequentially or jointly perform the …

Multi source data association clustering analysis based on symmetric encryption algorithm

H Wang - Mobile Networks and Applications, 2022‏ - Springer
Due to the low clustering accuracy of the existing methods, a multi-source data association
clustering method based on symmetric encryption algorithm is proposed. The multi-source …