Fast noise removal for k-means clustering

S Im, MM Qaem, B Moseley, X Sun… - International …, 2020 - proceedings.mlr.press
This paper considers k-means clustering in the presence of noise. It is known that k-means
clustering is highly sensitive to noise, and thus noise should be removed to obtain a quality …

Debunking free fusion myth: Online multi-view anomaly detection with disentangled product-of-experts modeling

H Wang, ZQ Cheng, J Sun, X Yang, X Wu… - Proceedings of the 31st …, 2023 - dl.acm.org
Multi-view or even multi-modal data is appealing yet challenging for real-world applications.
Detecting anomalies in multi-view data is a prominent recent research topic. However, most …

Fast algorithms for distributed k-clustering with outliers

J Huang, Q Feng, Z Huang, J Xu… - … on Machine Learning, 2023 - proceedings.mlr.press
In this paper, we study the $ k $-clustering problems with outliers in distributed setting. The
current best results for the distributed $ k $-center problem with outliers have quadratic local …

Greedy sampling for approximate clustering in the presence of outliers

A Bhaskara, S Vadgama, H Xu - Advances in Neural …, 2019 - proceedings.neurips.cc
Greedy algorithms such as adaptive sampling (k-means++) and furthest point traversal are
popular choices for clustering problems. One the one hand, they possess good theoretical …

Distributed -Clustering for Data with Heavy Noise

S Li, X Guo - Advances in Neural Information Processing …, 2018 - proceedings.neurips.cc
In this paper, we consider the $ k $-center/median/means clustering with outliers problems
(or the $(k, z) $-center/median/means problems) in the distributed setting. Most previous …

Federated matrix factorization: Algorithm design and application to data clustering

S Wang, TH Chang - IEEE Transactions on Signal Processing, 2022 - ieeexplore.ieee.org
Recent demands on data privacy have called for federated learning (FL) as a new
distributed learning paradigm in massive and heterogeneous networks. Although many FL …

SDCOR: Scalable density-based clustering for local outlier detection in massive-scale datasets

SAN Nozad, MA Haeri, G Folino - Knowledge-based systems, 2021 - Elsevier
This paper presents a batch-wise density-based clustering approach for local outlier
detection in massive-scale datasets. Unlike the well-known traditional algorithms, which …

A neighborhood weighted-based method for the detection of outliers

ZY **ong, H Long, YF Zhang, XX Wang, QQ Gao… - Applied …, 2023 - Springer
Outlierdetection is an important research direction in data mining, including fraud detection,
activity monitoring, medical research, network intrusion detection, etc. Many outlier detection …

[HTML][HTML] MapReduce algorithms for robust center-based clustering in doubling metrics

E Dandolo, A Mazzetto, A Pietracaprina… - Journal of Parallel and …, 2024 - Elsevier
Clustering is a pivotal primitive for unsupervised learning and data analysis. A popular
variant is the (k, ℓ)-clustering problem, where, given a pointset P from a metric space, one …

Demystifying model averaging for communication-efficient federated matrix factorization

S Wang, RC Suwandi, TH Chang - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Federated learning (FL) is encountered with the challenge of training a model in massive
and heterogeneous networks. Model averaging (MA) has become a popular FL paradigm …