Frameworks and results in distributionally robust optimization

H Rahimian, S Mehrotra - Open Journal of Mathematical Optimization, 2022 - numdam.org
The concepts of risk aversion, chance-constrained optimization, and robust optimization
have developed significantly over the last decade. The statistical learning community has …

Computational optimal transport: With applications to data science

G Peyré, M Cuturi - Foundations and Trends® in Machine …, 2019 - nowpublishers.com
Optimal transport (OT) theory can be informally described using the words of the French
mathematician Gaspard Monge (1746–1818): A worker with a shovel in hand has to move a …

Learning generative models with sinkhorn divergences

A Genevay, G Peyré, M Cuturi - International Conference on …, 2018 - proceedings.mlr.press
The ability to compare two degenerate probability distributions, that is two distributions
supported on low-dimensional manifolds in much higher-dimensional spaces, is a crucial …

Estimating individual treatment effect: generalization bounds and algorithms

U Shalit, FD Johansson… - … conference on machine …, 2017 - proceedings.mlr.press
There is intense interest in applying machine learning to problems of causal inference in
fields such as healthcare, economics and education. In particular, individual-level causal …

Kernel mean embedding of distributions: A review and beyond

K Muandet, K Fukumizu… - … and Trends® in …, 2017 - nowpublishers.com
A Hilbert space embedding of a distribution—in short, a kernel mean embedding—has
recently emerged as a powerful tool for machine learning and statistical inference. The basic …

Sample complexity of sinkhorn divergences

A Genevay, L Chizat, F Bach… - The 22nd …, 2019 - proceedings.mlr.press
Optimal transport (OT) and maximum mean discrepancies (MMD) are now routinely used in
machine learning to compare probability measures. We focus in this paper on Sinkhorn …

A mathematical perspective on transformers

B Geshkovski, C Letrouit, Y Polyanskiy… - arxiv preprint arxiv …, 2023 - arxiv.org
Transformers play a central role in the inner workings of large language models. We
develop a mathematical framework for analyzing Transformers based on their interpretation …

Stochastic optimization for large-scale optimal transport

A Genevay, M Cuturi, G Peyré… - Advances in neural …, 2016 - proceedings.neurips.cc
Optimal transport (OT) defines a powerful framework to compare probability distributions in a
geometrically faithful way. However, the practical impact of OT is still limited because of its …

Unbalanced optimal transport, from theory to numerics

T Séjourné, G Peyré, FX Vialard - Handbook of Numerical Analysis, 2023 - Elsevier
Optimal Transport (OT) has recently emerged as a central tool in data sciences to compare
in a geometrically faithful way point clouds and more generally probability distributions. The …

A unified recipe for deriving (time-uniform) PAC-Bayes bounds

B Chugg, H Wang, A Ramdas - Journal of Machine Learning Research, 2023 - jmlr.org
We present a unified framework for deriving PAC-Bayesian generalization bounds. Unlike
most previous literature on this topic, our bounds are anytime-valid (ie, time-uniform) …