Diffusion models in vision: A survey

FA Croitoru, V Hondru, RT Ionescu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Denoising diffusion models represent a recent emerging topic in computer vision,
demonstrating remarkable results in the area of generative modeling. A diffusion model is a …

Diffusion models: A comprehensive survey of methods and applications

L Yang, Z Zhang, Y Song, S Hong, R Xu, Y Zhao… - ACM Computing …, 2023 - dl.acm.org
Diffusion models have emerged as a powerful new family of deep generative models with
record-breaking performance in many applications, including image synthesis, video …

Align your latents: High-resolution video synthesis with latent diffusion models

A Blattmann, R Rombach, H Ling… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding
excessive compute demands by training a diffusion model in a compressed lower …

Imagic: Text-based real image editing with diffusion models

B Kawar, S Zada, O Lang, O Tov… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-conditioned image editing has recently attracted considerable interest. However, most
methods are currently limited to one of the following: specific editing types (eg, object …

Diffbir: Toward blind image restoration with generative diffusion prior

X Lin, J He, Z Chen, Z Lyu, B Dai, F Yu, Y Qiao… - … on Computer Vision, 2024 - Springer
We present DiffBIR, a general restoration pipeline that could handle different blind image
restoration tasks in a unified framework. DiffBIR decouples blind image restoration problem …

ediff-i: Text-to-image diffusion models with an ensemble of expert denoisers

Y Balaji, S Nah, X Huang, A Vahdat, J Song… - arxiv preprint arxiv …, 2022 - arxiv.org
Large-scale diffusion-based generative models have led to breakthroughs in text-
conditioned high-resolution image synthesis. Starting from random noise, such text-to-image …

Consistency models

Y Song, P Dhariwal, M Chen, I Sutskever - arxiv preprint arxiv:2303.01469, 2023 - arxiv.org
Diffusion models have significantly advanced the fields of image, audio, and video
generation, but they depend on an iterative sampling process that causes slow generation …

A survey on video diffusion models

Z **ng, Q Feng, H Chen, Q Dai, H Hu, H Xu… - ACM Computing …, 2024 - dl.acm.org
The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …

Physdiff: Physics-guided human motion diffusion model

Y Yuan, J Song, U Iqbal, A Vahdat… - Proceedings of the …, 2023 - openaccess.thecvf.com
Denoising diffusion models hold great promise for generating diverse and realistic human
motions. However, existing motion diffusion models largely disregard the laws of physics in …

Resshift: Efficient diffusion model for image super-resolution by residual shifting

Z Yue, J Wang, CC Loy - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Diffusion-based image super-resolution (SR) methods are mainly limited by the low
inference speed due to the requirements of hundreds or even thousands of sampling steps …