Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Improved Coresets for Euclidean -Means
Given a set of $ n $ points in $ d $ dimensions, the Euclidean $ k $-means problem (resp.
Euclidean $ k $-median) consists of finding $ k $ centers such that the sum of squared …
Euclidean $ k $-median) consists of finding $ k $ centers such that the sum of squared …
Towards optimal lower bounds for k-median and k-means coresets
The (k, z)-clustering problem consists of finding a set of k points called centers, such that the
sum of distances raised to the power of z of every data point to its closest center is …
sum of distances raised to the power of z of every data point to its closest center is …
The power of uniform sampling for coresets
Motivated by practical generalizations of the classic k-median and k-means objectives, such
as clustering with size constraints, fair clustering, and Wasserstein barycenter, we introduce …
as clustering with size constraints, fair clustering, and Wasserstein barycenter, we introduce …
Coverage-centric coreset selection for high pruning rates
One-shot coreset selection aims to select a representative subset of the training data, given
a pruning rate, that can later be used to train future models while retaining high accuracy …
a pruning rate, that can later be used to train future models while retaining high accuracy …
Improved coresets and sublinear algorithms for power means in euclidean spaces
V Cohen-Addad, D Saulpic… - Advances in Neural …, 2021 - proceedings.neurips.cc
In this paper, we consider the problem of finding high dimensional power means: given a set
$ A $ of $ n $ points in $\R^ d $, find the point $ m $ that minimizes the sum of Euclidean …
$ A $ of $ n $ points in $\R^ d $, find the point $ m $ that minimizes the sum of Euclidean …
New subset selection algorithms for low rank approximation: Offline and online
Subset selection for the rank k approximation of an n× d matrix A offers improvements in the
interpretability of matrices, as well as a variety of computational savings. This problem is well …
interpretability of matrices, as well as a variety of computational savings. This problem is well …
Coresets for Vertical Federated Learning: Regularized Linear Regression and -Means Clustering
Vertical federated learning (VFL), where data features are stored in multiple parties
distributively, is an important area in machine learning. However, the communication …
distributively, is an important area in machine learning. However, the communication …
Streaming Euclidean k-median and k-means with o (log n) Space
We consider the classic Euclidean k-median and k-means objective on data streams, where
the goal is to provide a (1+ε)-approximation to the optimal k-median or k-means solution …
the goal is to provide a (1+ε)-approximation to the optimal k-median or k-means solution …
Tight bounds for volumetric spanners and applications
Given a set of points of interest, a volumetric spanner is a subset of the points using which all
the points can be expressed using" small" coefficients (measured in an appropriate norm) …
the points can be expressed using" small" coefficients (measured in an appropriate norm) …
Near-Optimal -Clustering in the Sliding Window Model
Clustering is an important technique for identifying structural information in large-scale data
analysis, where the underlying dataset may be too large to store. In many applications …
analysis, where the underlying dataset may be too large to store. In many applications …