Recent advances in optimal transport for machine learning

EF Montesuma, FMN Mboula… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Recently, Optimal Transport has been proposed as a probabilistic framework in Machine
Learning for comparing and manipulating probability distributions. This is rooted in its rich …

Improving and generalizing flow-based generative models with minibatch optimal transport

A Tong, K Fatras, N Malkin, G Huguet, Y Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Continuous normalizing flows (CNFs) are an attractive generative modeling technique, but
they have thus far been held back by limitations in their simulation-based maximum …

Multisample flow matching: Straightening flows with minibatch couplings

AA Pooladian, H Ben-Hamu, C Domingo-Enrich… - arxiv preprint arxiv …, 2023 - arxiv.org
Simulation-free methods for training continuous-time generative models construct probability
paths that go between noise distributions and individual data samples. Recent works, such …

Scalable optimal transport methods in machine learning: A contemporary survey

A Khamis, R Tsuchida, M Tarek… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Optimal Transport (OT) is a mathematical framework that first emerged in the eighteenth
century and has led to a plethora of methods for answering many theoretical and applied …

Alignment and integration of spatial transcriptomics data

R Zeira, M Land, A Strzalkowski, BJ Raphael - Nature Methods, 2022 - nature.com
Spatial transcriptomics (ST) measures mRNA expression across thousands of spots from a
tissue slice while recording the two-dimensional (2D) coordinates of each spot. We …

Unbalanced minibatch optimal transport; applications to domain adaptation

K Fatras, T Séjourné, R Flamary… - … on Machine Learning, 2021 - proceedings.mlr.press
Optimal transport distances have found many applications in machine learning for their
capacity to compare non-parametric probability distributions. Yet their algorithmic complexity …

Missing data imputation using optimal transport

B Muzellec, J Josse, C Boyer… - … Conference on Machine …, 2020 - proceedings.mlr.press
Missing data is a crucial issue when applying machine learning algorithms to real-world
datasets. Starting from the simple assumption that two batches extracted randomly from the …

[PDF][PDF] Conditional flow matching: Simulation-free dynamic optimal transport

A Tong, N Malkin, G Huguet, Y Zhang… - arxiv preprint arxiv …, 2023 - researchgate.net
Continuous normalizing flows (CNFs) are an attractive generative modeling technique, but
they have thus far been held back by limitations in their simulation-based maximum …

Simulation-free schr\" odinger bridges via score and flow matching

A Tong, N Malkin, K Fatras, L Atanackovic… - arxiv preprint arxiv …, 2023 - arxiv.org
We present simulation-free score and flow matching ([SF] $^ 2$ M), a simulation-free
objective for inferring stochastic dynamics given unpaired samples drawn from arbitrary …

Generating and imputing tabular data via diffusion and flow-based gradient-boosted trees

A Jolicoeur-Martineau, K Fatras… - International …, 2024 - proceedings.mlr.press
Tabular data is hard to acquire and is subject to missing values. This paper introduces a
novel approach for generating and imputing mixed-type (continuous and categorical) tabular …