Sinkhorn distances: Lightspeed computation of optimal transport

M Cuturi - Advances in neural information processing …, 2013 - proceedings.neurips.cc
Optimal transportation distances are a fundamental family of parameterized distances for
histograms in the probability simplex. Despite their appealing theoretical properties …

Parallel algorithms for geometric graph problems

A Andoni, A Nikolov, K Onak… - Proceedings of the forty …, 2014 - dl.acm.org
We give algorithms for geometric graph problems in the modern parallel models such as
MapReduce. For example, for the Minimum Spanning Tree (MST) problem over a set of …

On invariance and selectivity in representation learning

F Anselmi, L Rosasco, T Poggio - Information and Inference: A …, 2016 - academic.oup.com
We study the problem of learning from data representations that are invariant to
transformations, and at the same time selective, in the sense that two points have the same …

Nearest neighbor search: the old, the new, and the impossible

A Andoni - 2009 - dspace.mit.edu
Over the last decade, an immense amount of data has become available. From collections of
photos, to genetic data, and to network traffic statistics, modern technologies and cheap …

Streaming algorithms via precision sampling

A Andoni, R Krauthgamer… - 2011 IEEE 52nd Annual …, 2011 - ieeexplore.ieee.org
A technique introduced by Indyk and Woodruff (STOC 2005) has inspired several recent
advances in data-stream algorithms. We show that a number of these results follow eas-ily …

[PDF][PDF] Learning-augmented data stream algorithms

T Jiang, Y Li, H Lin, Y Ruan, DP Woodruff - ICLR, 2020 - par.nsf.gov
The data stream model is a fundamental model for processing massive data sets with limited
memory and fast processing time. Recently Hsu et al.(2019) incorporated machine learning …

Subspace embeddings for the L1-norm with applications

C Sohler, DP Woodruff - Proceedings of the forty-third annual ACM …, 2011 - dl.acm.org
We show there is a distribution over linear map**s R: l1n-> l1O (d log d), such that with
arbitrarily large constant probability, for any fixed d-dimensional subspace L, for all x∈ L we …

Streaming euclidean mst to a constant factor

X Chen, V Cohen-Addad, R Jayaram, A Levi… - Proceedings of the 55th …, 2023 - dl.acm.org
We study streaming algorithms for the fundamental geometric problem of computing the cost
of the Euclidean Minimum Spanning Tree (MST) on an n-point set X⊂ ℝ d. In the streaming …

New streaming algorithms for high dimensional EMD and MST

X Chen, R Jayaram, A Levi, E Waingarten - Proceedings of the 54th …, 2022 - dl.acm.org
We study streaming algorithms for two fundamental geometric problems: computing the cost
of a Minimum Spanning Tree (MST) of an n-point set X⊂{1, 2,…, Δ} d, and computing the …

Sketching sparse matrices, covariances, and graphs via tensor products

G Dasarathy, P Shah, BN Bhaskar… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
This paper considers the problem of recovering an unknown sparse p× p matrix X from an
m× m matrix Y= AXB T, where A and B are known m× p matrices with m≪ p. The main result …