Comprehensive survey on hierarchical clustering algorithms and the recent developments

X Ran, Y **, Y Lu, X Wang, Z Lu - Artificial Intelligence Review, 2023 - Springer
Data clustering is a commonly used data processing technique in many fields, which divides
objects into different clusters in terms of some similarity measure between data points …

A comprehensive survey of clustering algorithms: State-of-the-art machine learning applications, taxonomy, challenges, and future research prospects

AE Ezugwu, AM Ikotun, OO Oyelade… - … Applications of Artificial …, 2022 - Elsevier
Clustering is an essential tool in data mining research and applications. It is the subject of
active research in many fields of study, such as computer science, data science, statistics …

The effects of data quality on machine learning performance

L Budach, M Feuerpfeil, N Ihde, A Nathansen… - arxiv preprint arxiv …, 2022 - arxiv.org
Modern artificial intelligence (AI) applications require large quantities of training and test
data. This need creates critical challenges not only concerning the availability of such data …

Large language models enable few-shot clustering

V Viswanathan, K Gashteovski… - Transactions of the …, 2024 - direct.mit.edu
Unlike traditional unsupervised clustering, semi-supervised clustering allows users to
provide meaningful structure to the data, which helps the clustering algorithm to match the …

Deep learning networks for stock market analysis and prediction: Methodology, data representations, and case studies

E Chong, C Han, FC Park - Expert Systems with Applications, 2017 - Elsevier
We offer a systematic analysis of the use of deep learning networks for stock market analysis
and prediction. Its ability to extract features from a large set of raw data without relying on …

Clustering huge protein sequence sets in linear time

M Steinegger, J Söding - Nature communications, 2018 - nature.com
Metagenomic datasets contain billions of protein sequences that could greatly enhance
large-scale functional annotation and structure prediction. Utilizing this enormous resource …

Algorithms for hierarchical clustering: an overview, II

F Murtagh, P Contreras - Wiley Interdisciplinary Reviews: Data …, 2017 - Wiley Online Library
We survey agglomerative hierarchical clustering algorithms and discuss efficient
implementations that are available in R and other software environments. We look at …

Securing the smart grid: A comprehensive compilation of intrusion detection and prevention systems

PI Radoglou-Grammatikis, PG Sarigiannidis - Ieee Access, 2019 - ieeexplore.ieee.org
The smart grid (SG) paradigm is the next technological leap of the conventional electrical
grid, contributing to the protection of the physical environment and providing multiple …

Spot the conversation: speaker diarisation in the wild

JS Chung, J Huh, A Nagrani, T Afouras… - arxiv preprint arxiv …, 2020 - arxiv.org
The goal of this paper is speaker diarisation of videos collected'in the wild'. We make three
key contributions. First, we propose an automatic audio-visual diarisation method for …

Classification and clustering of arguments with contextualized word embeddings

N Reimers, B Schiller, T Beck, J Daxenberger… - arxiv preprint arxiv …, 2019 - arxiv.org
We experiment with two recent contextualized word embedding methods (ELMo and BERT)
in the context of open-domain argument search. For the first time, we show how to leverage …