A comprehensive survey of machine learning methodologies with emphasis in water resources management

M Drogkoula, K Kokkinos, N Samaras - Applied Sciences, 2023 - mdpi.com
This paper offers a comprehensive overview of machine learning (ML) methodologies and
algorithms, highlighting their practical applications in the critical domain of water resource …

Short text clustering algorithms, application and challenges: A survey

MH Ahmed, S Tiun, N Omar, NS Sani - Applied Sciences, 2022 - mdpi.com
The number of online documents has rapidly grown, and with the expansion of the Web,
document analysis, or text analysis, has become an essential task for preparing, storing …

Supporting clustering with contrastive learning

D Zhang, F Nan, X Wei, S Li, H Zhu, K McKeown… - arxiv preprint arxiv …, 2021 - arxiv.org
Unsupervised clustering aims at discovering the semantic categories of data according to
some distance measured in the representation space. However, different categories often …

Autoencoders and their applications in machine learning: a survey

K Berahmand, F Daneshfar, ES Salehi, Y Li… - Artificial Intelligence …, 2024 - Springer
Autoencoders have become a hot researched topic in unsupervised learning due to their
ability to learn data features and act as a dimensionality reduction method. With rapid …

Evaluation of clustering and topic modeling methods over health-related tweets and emails

JA Lossio-Ventura, S Gonzales, J Morzan… - Artificial intelligence in …, 2021 - Elsevier
Background Internet provides different tools for communicating with patients, such as social
media (eg, Twitter) and email platforms. These platforms provided new data sources to shed …

Clusterllm: Large language models as a guide for text clustering

Y Zhang, Z Wang, J Shang - arxiv preprint arxiv:2305.14871, 2023 - arxiv.org
We introduce ClusterLLM, a novel text clustering framework that leverages feedback from an
instruction-tuned large language model, such as ChatGPT. Compared with traditional …

Synergizing large language models and pre-trained smaller models for conversational intent discovery

J Liang, L Liao, H Fei, J Jiang - Findings of the Association for …, 2024 - aclanthology.org
Abstract In Conversational Intent Discovery (CID), Small Language Models (SLMs) struggle
with overfitting to familiar intents and fail to label newly discovered ones. This issue stems …

Joint unsupervised contrastive learning and robust GMM for text clustering

C Hu, T Wu, S Liu, C Liu, T Ma, F Yang - Information Processing & …, 2024 - Elsevier
Text clustering aims to organize a vast collection of documents into meaningful and coherent
clusters, thereby facilitating the extraction of valuable insights. While current frameworks for …

Enhancement of short text clustering by iterative classification

MRH Rakib, N Zeh, M Jankowska, E Milios - Natural Language Processing …, 2020 - Springer
Short text clustering is a challenging task due to the lack of signal contained in short texts. In
this work, we propose iterative classification as a method to boost the clustering quality of …

An online semantic-enhanced Dirichlet model for short text stream clustering

J Kumar, J Shao, S Uddin, W Ali - … of the 58th annual meeting of …, 2020 - aclanthology.org
Clustering short text streams is a challenging task due to its unique properties: infinite
length, sparse data representation and cluster evolution. Existing approaches often exploit …