Near-linear time approximation algorithms for optimal transport via Sinkhorn iteration

J Altschuler, J Niles-Weed… - Advances in neural …, 2017 - proceedings.neurips.cc
Computing optimal transport distances such as the earth mover's distance is a fundamental
problem in machine learning, statistics, and computer vision. Despite the recent introduction …

Transformers, parallel computation, and logarithmic depth

C Sanford, D Hsu, M Telgarsky - arxiv preprint arxiv:2402.09268, 2024 - arxiv.org
We show that a constant number of self-attention layers can efficiently simulate, and be
simulated by, a constant number of communication rounds of Massively Parallel …

Sublinear algorithms for (Δ+ 1) vertex coloring

S Assadi, Y Chen, S Khanna - Proceedings of the Thirtieth Annual ACM-SIAM …, 2019 - SIAM
Any graph with maximum degree Δ admits a proper vertex coloring with Δ+ 1 colors that can
be found via a simple sequential greedy algorithm in linear time and space. But can one find …

Affinity clustering: Hierarchical clustering at scale

MH Bateni, S Behnezhad… - Advances in …, 2017 - proceedings.neurips.cc
Graph clustering is a fundamental task in many data-mining and machine-learning
pipelines. In particular, identifying a good hierarchical structure is at the same time a …

Improved massively parallel computation algorithms for mis, matching, and vertex cover

M Ghaffari, T Gouleakis, C Konrad, S Mitrović… - Proceedings of the …, 2018 - dl.acm.org
We present O (loglog n)-round algorithms in the Massively Parallel Computation (MPC)
model, with Õ (n) memory per machine, that compute a maximal independent set, a 1+ ε …

Coresets meet EDCS: algorithms for matching and vertex cover on massive graphs

S Assadi, MH Bateni, A Bernstein, V Mirrokni… - Proceedings of the …, 2019 - SIAM
There is a rapidly growing need for scalable algorithms that solve classical graph problems,
such as maximum matching and minimum vertex cover, on massive graphs. For massive …

Parallel graph connectivity in log diameter rounds

A Andoni, Z Song, C Stein, Z Wang… - 2018 IEEE 59th Annual …, 2018 - ieeexplore.ieee.org
Many modern parallel systems, such as MapReduce, Hadoop and Spark, can be modeled
well by the MPC model. The MPC model captures well coarse-grained computation on large …

The complexity of (Δ+ 1) coloring in congested clique, massively parallel computation, and centralized local computation

YJ Chang, M Fischer, M Ghaffari, J Uitto… - Proceedings of the 2019 …, 2019 - dl.acm.org
In this paper, we present new randomized algorithms that improve the complexity of the
classic (Δ+ 1)-coloring problem, and its generalization (Δ+ 1)-list-coloring, in three well …

Randomized composable core-sets for distributed submodular maximization

V Mirrokni, M Zadimoghaddam - … of the forty-seventh annual ACM …, 2015 - dl.acm.org
An effective technique for solving optimization problems over massive data sets is to
partition the data into smaller pieces, solve the problem on each piece and compute a …

Parallel approximate undirected shortest paths via low hop emulators

A Andoni, C Stein, P Zhong - Proceedings of the 52nd Annual ACM …, 2020 - dl.acm.org
We present a (1+ ε)-approximate parallel algorithm for computing shortest paths in
undirected graphs, achieving poly (log n) depth and m poly (log n) work for n-nodes m …