K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data

AM Ikotun, AE Ezugwu, L Abualigah, B Abuhaija… - Information …, 2023 - Elsevier
Advances in recent techniques for scientific data collection in the era of big data allow for the
systematic accumulation of large quantities of data at various data-capturing sites. Similarly …

A taxonomy of machine learning clustering algorithms, challenges, and future realms

S Pitafi, T Anwar, Z Sharif - Applied sciences, 2023 - mdpi.com
In the field of data mining, clustering has shown to be an important technique. Numerous
clustering methods have been devised and put into practice, and most of them locate high …

Internet of Things-driven data mining for smart crop production prediction in the peasant farming domain

LO Colombo-Mendoza, MA Paredes-Valverde… - Applied Sciences, 2022 - mdpi.com
Internet of Things (IoT) technologies can greatly benefit from machine-learning techniques
and artificial neural networks for data mining and vice versa. In the agricultural field, this …

Mental fatigue detection using a wearable commodity device and machine learning

C Goumopoulos, N Potha - Journal of Ambient Intelligence and …, 2023 - Springer
Mental fatigue is a psychophysiological state that has an intense adverse effect on the
quality of life, undermining both the mental and the physical health. As a consequence …

Determining the quality of a dataset in clustering terms

A Rachwał, E Popławska, I Gorgol, T Cieplak… - Applied Sciences, 2023 - mdpi.com
The purpose of the theoretical considerations and research conducted was to indicate the
instruments with which the quality of a dataset can be verified for the segmentation of …

Prediction of diabetes complications using computational intelligence techniques

T Alghamdi - Applied Sciences, 2023 - mdpi.com
Diabetes is a complex disease that can lead to serious health complications if left
unmanaged. Early detection and treatment of diabetes is crucial, and data analysis and …

A survey of machine learning and meta-heuristics approaches for sensor-based human activity recognition systems

A Saha, S Rajak, J Saha, C Chowdhury - Journal of Ambient Intelligence …, 2024 - Springer
Abstract Human Activity Recognition (HAR) is an important research area that has profound
applications in healthcare, security and surveillance. Starting from traditional machine …

Are cluster validity measures (in) valid?

M Gagolewski, M Bartoszuk, A Cena - Information Sciences, 2021 - Elsevier
Internal cluster validity measures (such as the Calinski–Harabasz, Dunn, or Davies–Bouldin
indices) are frequently used for selecting the appropriate number of partitions a dataset …

A comprehensive review of clustering techniques in artificial intelligence for knowledge discovery: Taxonomy, challenges, applications and future prospects

J Singh, D Singh - Advanced Engineering Informatics, 2024 - Elsevier
Clustering is a set of essential mathematical techniques in artificial intelligence and machine
learning for analyzing massive amounts of data generated by applications. Clustering uses …

A generalized multi-aspect distance metric for mixed-type data clustering

E Mousavi, M Sehhati - Pattern Recognition, 2023 - Elsevier
Distance calculation is straightforward when working with pure categorical or pure numerical
data sets. Defining a unified distance to improve the clustering performance for a mixed data …