Diffusion models in vision: A survey

FA Croitoru, V Hondru, RT Ionescu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Denoising diffusion models represent a recent emerging topic in computer vision,
demonstrating remarkable results in the area of generative modeling. A diffusion model is a …

Diffusion models: A comprehensive survey of methods and applications

L Yang, Z Zhang, Y Song, S Hong, R Xu, Y Zhao… - ACM Computing …, 2023 - dl.acm.org
Diffusion models have emerged as a powerful new family of deep generative models with
record-breaking performance in many applications, including image synthesis, video …

Diffusion policy: Visuomotor policy learning via action diffusion

C Chi, Z Xu, S Feng, E Cousineau… - … Journal of Robotics …, 2023 - journals.sagepub.com
This paper introduces Diffusion Policy, a new way of generating robot behavior by
representing a robot's visuomotor policy as a conditional denoising diffusion process. We …

Sdxl: Improving latent diffusion models for high-resolution image synthesis

D Podell, Z English, K Lacey, A Blattmann… - arxiv preprint arxiv …, 2023 - arxiv.org
We present SDXL, a latent diffusion model for text-to-image synthesis. Compared to
previous versions of Stable Diffusion, SDXL leverages a three times larger UNet backbone …

Accurate structure prediction of biomolecular interactions with AlphaFold 3

J Abramson, J Adler, J Dunger, R Evans, T Green… - Nature, 2024 - nature.com
The introduction of AlphaFold 21 has spurred a revolution in modelling the structure of
proteins and their interactions, enabling a huge range of applications in protein modelling …

Structure and content-guided video synthesis with diffusion models

P Esser, J Chiu, P Atighehchian… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-guided generative diffusion models unlock powerful image creation and editing tools.
Recent approaches that edit the content of footage while retaining structure require …

Scalable diffusion models with transformers

W Peebles, S **e - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
We explore a new class of diffusion models based on the transformer architecture. We train
latent diffusion models of images, replacing the commonly-used U-Net backbone with a …

Emergent correspondence from image diffusion

L Tang, M Jia, Q Wang, CP Phoo… - Advances in Neural …, 2023 - proceedings.neurips.cc
Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …

Score jacobian chaining: Lifting pretrained 2d diffusion models for 3d generation

H Wang, X Du, J Li, RA Yeh… - Proceedings of the …, 2023 - openaccess.thecvf.com
A diffusion model learns to predict a vector field of gradients. We propose to apply chain rule
on the learned gradients, and back-propagate the score of a diffusion model through the …

Shap-e: Generating conditional 3d implicit functions

H Jun, A Nichol - arxiv preprint arxiv:2305.02463, 2023 - arxiv.org
We present Shap-E, a conditional generative model for 3D assets. Unlike recent work on 3D
generative models which produce a single output representation, Shap-E directly generates …