A comprehensive survey of machine learning methodologies with emphasis in water resources management
This paper offers a comprehensive overview of machine learning (ML) methodologies and
algorithms, highlighting their practical applications in the critical domain of water resource …
algorithms, highlighting their practical applications in the critical domain of water resource …
Short text clustering algorithms, application and challenges: A survey
The number of online documents has rapidly grown, and with the expansion of the Web,
document analysis, or text analysis, has become an essential task for preparing, storing …
document analysis, or text analysis, has become an essential task for preparing, storing …
Supporting clustering with contrastive learning
Unsupervised clustering aims at discovering the semantic categories of data according to
some distance measured in the representation space. However, different categories often …
some distance measured in the representation space. However, different categories often …
Autoencoders and their applications in machine learning: a survey
Autoencoders have become a hot researched topic in unsupervised learning due to their
ability to learn data features and act as a dimensionality reduction method. With rapid …
ability to learn data features and act as a dimensionality reduction method. With rapid …
Evaluation of clustering and topic modeling methods over health-related tweets and emails
Background Internet provides different tools for communicating with patients, such as social
media (eg, Twitter) and email platforms. These platforms provided new data sources to shed …
media (eg, Twitter) and email platforms. These platforms provided new data sources to shed …
Clusterllm: Large language models as a guide for text clustering
We introduce ClusterLLM, a novel text clustering framework that leverages feedback from an
instruction-tuned large language model, such as ChatGPT. Compared with traditional …
instruction-tuned large language model, such as ChatGPT. Compared with traditional …
Synergizing large language models and pre-trained smaller models for conversational intent discovery
Abstract In Conversational Intent Discovery (CID), Small Language Models (SLMs) struggle
with overfitting to familiar intents and fail to label newly discovered ones. This issue stems …
with overfitting to familiar intents and fail to label newly discovered ones. This issue stems …
Joint unsupervised contrastive learning and robust GMM for text clustering
C Hu, T Wu, S Liu, C Liu, T Ma, F Yang - Information Processing & …, 2024 - Elsevier
Text clustering aims to organize a vast collection of documents into meaningful and coherent
clusters, thereby facilitating the extraction of valuable insights. While current frameworks for …
clusters, thereby facilitating the extraction of valuable insights. While current frameworks for …
Enhancement of short text clustering by iterative classification
Short text clustering is a challenging task due to the lack of signal contained in short texts. In
this work, we propose iterative classification as a method to boost the clustering quality of …
this work, we propose iterative classification as a method to boost the clustering quality of …
An online semantic-enhanced Dirichlet model for short text stream clustering
Clustering short text streams is a challenging task due to its unique properties: infinite
length, sparse data representation and cluster evolution. Existing approaches often exploit …
length, sparse data representation and cluster evolution. Existing approaches often exploit …