Fair algorithms for clustering

S Bera, D Chakrabarty, N Flores… - Advances in Neural …, 2019 - proceedings.neurips.cc
We study the problem of finding low-cost {\em fair clusterings} in data where each data point
may belong to many protected groups. Our work significantly generalizes the seminal work …

Better Guarantees for -Means and Euclidean -Median by Primal-Dual Algorithms

S Ahmadian, A Norouzi-Fard, O Svensson… - SIAM Journal on …, 2019 - SIAM
Clustering is a classic topic in optimization with k-means being one of the most fundamental
such problems. In the absence of any restrictions on the input, the best-known algorithm for k …

Fair clustering via equitable group representations

M Abbasi, A Bhaskara… - Proceedings of the 2021 …, 2021 - dl.acm.org
What does it mean for a clustering to be fair? One popular approach seeks to ensure that
each cluster contains groups in (roughly) the same proportion in which they exist in the …

Fast clustering using MapReduce

A Ene, S Im, B Moseley - Proceedings of the 17th ACM SIGKDD …, 2011 - dl.acm.org
Clustering problems have numerous applications and are becoming more challenging as
the size of the data increases. In this paper, we consider designing clustering algorithms that …

Local search methods for k-means with outliers

S Gupta, R Kumar, K Lu, B Moseley… - Proceedings of the VLDB …, 2017 - dl.acm.org
We study the problem of k-means clustering in the presence of outliers. The goal is to cluster
a set of data points to minimize the variance of the points assigned to the same cluster, with …

Local Search Yields a PTAS for -Means in Doubling Metrics

Z Friggstad, M Rezapour, MR Salavatipour - SIAM Journal on Computing, 2019 - SIAM
The most well-known and ubiquitous clustering problem encountered in nearly every branch
of science is undoubtedly k-means: given a set of data points and a parameter k, select k …

Individual fairness for k-clustering

S Mahabadi, A Vakilian - International conference on …, 2020 - proceedings.mlr.press
We give a local search based algorithm for $ k $-median and $ k $-means (and more
generally for any $ k $-clustering with $\ell_p $ norm cost function) from the perspective of …

Improved spectral-norm bounds for clustering

P Awasthi, O Sheffet - International Workshop on Approximation …, 2012 - Springer
Aiming to unify known results about clustering mixtures of distributions under separation
conditions, Kumar and Kannan [1] introduced a deterministic condition for clustering …

Breaching the 2 LMP Approximation Barrier for Facility Location with Applications to k-Median

V Cohen-Addad Viallat, F Grandoni, E Lee… - Proceedings of the 2023 …, 2023 - SIAM
The Uncapacitated Facility Location (UFL) problem is one of the most fundamental
clustering problems: Given a set of clients C and a set of facilities F in a metric space (C∪ F …

A local search algorithm for k-means with outliers

Z Zhang, Q Feng, J Huang, Y Guo, J Xu, J Wang - Neurocomputing, 2021 - Elsevier
Abstract k-Means is a well-studied clustering problem that finds applications in many fields
related to unsupervised learning. It is known that k-means clustering is highly sensitive to the …