Diffusion models in vision: A survey
Denoising diffusion models represent a recent emerging topic in computer vision,
demonstrating remarkable results in the area of generative modeling. A diffusion model is a …
demonstrating remarkable results in the area of generative modeling. A diffusion model is a …
Diffusion models: A comprehensive survey of methods and applications
Diffusion models have emerged as a powerful new family of deep generative models with
record-breaking performance in many applications, including image synthesis, video …
record-breaking performance in many applications, including image synthesis, video …
Diffusion policy: Visuomotor policy learning via action diffusion
This paper introduces Diffusion Policy, a new way of generating robot behavior by
representing a robot's visuomotor policy as a conditional denoising diffusion process. We …
representing a robot's visuomotor policy as a conditional denoising diffusion process. We …
Sdxl: Improving latent diffusion models for high-resolution image synthesis
We present SDXL, a latent diffusion model for text-to-image synthesis. Compared to
previous versions of Stable Diffusion, SDXL leverages a three times larger UNet backbone …
previous versions of Stable Diffusion, SDXL leverages a three times larger UNet backbone …
Accurate structure prediction of biomolecular interactions with AlphaFold 3
The introduction of AlphaFold 21 has spurred a revolution in modelling the structure of
proteins and their interactions, enabling a huge range of applications in protein modelling …
proteins and their interactions, enabling a huge range of applications in protein modelling …
Structure and content-guided video synthesis with diffusion models
P Esser, J Chiu, P Atighehchian… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-guided generative diffusion models unlock powerful image creation and editing tools.
Recent approaches that edit the content of footage while retaining structure require …
Recent approaches that edit the content of footage while retaining structure require …
Scalable diffusion models with transformers
We explore a new class of diffusion models based on the transformer architecture. We train
latent diffusion models of images, replacing the commonly-used U-Net backbone with a …
latent diffusion models of images, replacing the commonly-used U-Net backbone with a …
Emergent correspondence from image diffusion
Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …
this paper, we show that correspondence emerges in image diffusion models without any …
Score jacobian chaining: Lifting pretrained 2d diffusion models for 3d generation
A diffusion model learns to predict a vector field of gradients. We propose to apply chain rule
on the learned gradients, and back-propagate the score of a diffusion model through the …
on the learned gradients, and back-propagate the score of a diffusion model through the …
Shap-e: Generating conditional 3d implicit functions
H Jun, A Nichol - arxiv preprint arxiv:2305.02463, 2023 - arxiv.org
We present Shap-E, a conditional generative model for 3D assets. Unlike recent work on 3D
generative models which produce a single output representation, Shap-E directly generates …
generative models which produce a single output representation, Shap-E directly generates …