An overview of diffusion models: Applications, guided generation, statistical rates and optimization

M Chen, S Mei, J Fan, M Wang - arxiv preprint arxiv:2404.07771, 2024 - arxiv.org
Diffusion models, a powerful and universal generative AI technology, have achieved
tremendous success in computer vision, audio, reinforcement learning, and computational …

Opportunities and challenges of diffusion models for generative AI

M Chen, S Mei, J Fan, M Wang - National Science Review, 2024 - academic.oup.com
Diffusion models, a powerful and universal generative artificial intelligence technology, have
achieved tremendous success and opened up new possibilities in diverse applications. In …

Sit: Exploring flow and diffusion-based generative models with scalable interpolant transformers

N Ma, M Goldstein, MS Albergo, NM Boffi… - … on Computer Vision, 2024 - Springer
Abstract We present Scalable Interpolant Transformers (SiT), a family of generative models
built on the backbone of Diffusion Transformers (DiT). The interpolant framework, which …

Stochastic interpolants: A unifying framework for flows and diffusions

MS Albergo, NM Boffi, E Vanden-Eijnden - arxiv preprint arxiv …, 2023 - arxiv.org
A class of generative models that unifies flow-based and diffusion-based methods is
introduced. These models extend the framework proposed in Albergo & Vanden-Eijnden …

Ddp: Diffusion model for dense visual prediction

Y Ji, Z Chen, E **e, L Hong, X Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose a simple, efficient, yet powerful framework for dense visual predictions based
on the conditional diffusion pipeline. Our approach follows a" noise-to-map" generative …

Score approximation, estimation and distribution recovery of diffusion models on low-dimensional data

M Chen, K Huang, T Zhao… - … Conference on Machine …, 2023 - proceedings.mlr.press
Diffusion models achieve state-of-the-art performance in various generation tasks. However,
their theoretical foundations fall far behind. This paper studies score approximation …

Improved analysis of score-based generative modeling: User-friendly bounds under minimal smoothness assumptions

H Chen, H Lee, J Lu - International Conference on Machine …, 2023 - proceedings.mlr.press
We give an improved theoretical analysis of score-based generative modeling. Under a
score estimate with small $ L^ 2$ error (averaged across timesteps), we provide efficient …

Consistency trajectory models: Learning probability flow ode trajectory of diffusion

D Kim, CH Lai, WH Liao, N Murata, Y Takida… - arxiv preprint arxiv …, 2023 - arxiv.org
Consistency Models (CM)(Song et al., 2023) accelerate score-based diffusion model
sampling at the cost of sample quality but lack a natural way to trade-off quality for speed. To …

The probability flow ode is provably fast

S Chen, S Chewi, H Lee, Y Li, J Lu… - Advances in Neural …, 2023 - proceedings.neurips.cc
We provide the first polynomial-time convergence guarantees for the probabilistic flow ODE
implementation (together with a corrector step) of score-based generative modeling. Our …

Diffusion models are minimax optimal distribution estimators

K Oko, S Akiyama, T Suzuki - International Conference on …, 2023 - proceedings.mlr.press
While efficient distribution learning is no doubt behind the groundbreaking success of
diffusion modeling, its theoretical guarantees are quite limited. In this paper, we provide the …