Solving -center Clustering (with Outliers) in MapReduce and Streaming, almost as Accurately as Sequentially
Center-based clustering is a fundamental primitive for data analysis and becomes very
challenging for large datasets. In this paper, we focus on the popular $ k $-center variant …
challenging for large datasets. In this paper, we focus on the popular $ k $-center variant …
Diversity maximization in the presence of outliers
D Amagata - Proceedings of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Given a set X of n points in a metric space, the problem of diversity maximization is to extract
a set S of k points from X so that the diversity of S is maximized. This problem is essential in …
a set S of k points from X so that the diversity of S is maximized. This problem is essential in …
Diverse data selection under fairness constraints
Diversity is an important principle in data selection and summarization, facility location, and
recommendation systems. Our work focuses on maximizing diversity in data selection, while …
recommendation systems. Our work focuses on maximizing diversity in data selection, while …
Local search for max-sum diversification
We provide simple and fast polynomial-time approximation schemes (PTASs) for several
variants of the max-sum diversification problem which, in its most basic form, is as follows …
variants of the max-sum diversification problem which, in its most basic form, is as follows …
Fair max–min diversity maximization in streaming and sliding-window models
Diversity maximization is a fundamental problem with broad applications in data
summarization, web search, and recommender systems. Given a set X of n elements, the …
summarization, web search, and recommender systems. Given a set X of n elements, the …
Improved sliding window algorithms for clustering and coverage via bucketing-based sketches
Streaming computation plays an important role in large-scale data analysis. The sliding
window model is a model of streaming computation which also captures the recency of the …
window model is a model of streaming computation which also captures the recency of the …
Composable core-sets for determinant maximization problems via spectral spanners
We study a generalization of classical combinatorial graph spanners to the spectral setting.
Given a set of vectors V⊆ ℝ d, we say a set U⊆ V is an α-spectral k spanner, for k≤ d, if for …
Given a set of vectors V⊆ ℝ d, we say a set U⊆ V is an α-spectral k spanner, for k≤ d, if for …
Improved approximation and scalability for fair max-min diversification
Given an $ n $-point metric space $(\mathcal {X}, d) $ where each point belongs to one of $
m= O (1) $ different categories or groups and a set of integers $ k_1,\ldots, k_m $, the fair …
m= O (1) $ different categories or groups and a set of integers $ k_1,\ldots, k_m $, the fair …
Composable core-sets for determinant maximization: A simple near-optimal algorithm
Abstract “Composable core-sets” are an efficient framework for solving optimization
problems in massive data models. In this work, we consider efficient construction of …
problems in massive data models. In this work, we consider efficient construction of …
Streaming algorithms for diversity maximization with fairness constraints
Diversity maximization is a fundamental problem with wide applications in data
summarization, web search, and recommender systems. Given a set X of n elements, it asks …
summarization, web search, and recommender systems. Given a set X of n elements, it asks …