Diffusion models, image super-resolution, and everything: A survey

BB Moser, AS Shanbhag, F Raue… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
Diffusion models (DMs) have disrupted the image super-resolution (SR) field and further
closed the gap between image quality and human perceptual preferences. They are easy to …

A survey of deep face restoration: Denoise, super-resolution, deblur, artifact removal

T Wang, K Zhang, X Chen, W Luo, J Deng, T Lu… - arxiv preprint arxiv …, 2022 - arxiv.org
Face Restoration (FR) aims to restore High-Quality (HQ) faces from Low-Quality (LQ) input
images, which is a domain-specific image restoration problem in the low-level computer …

Lavie: High-quality video generation with cascaded latent diffusion models

Y Wang, X Chen, X Ma, S Zhou, Z Huang… - International Journal of …, 2024 - Springer
This work aims to learn a high-quality text-to-video (T2V) generative model by leveraging a
pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task …

Exploiting diffusion prior for real-world image super-resolution

J Wang, Z Yue, S Zhou, KCK Chan, CC Loy - International Journal of …, 2024 - Springer
We present a novel approach to leverage prior knowledge encapsulated in pre-trained text-
to-image diffusion models for blind super-resolution. Specifically, by employing our time …

Diffbir: Toward blind image restoration with generative diffusion prior

X Lin, J He, Z Chen, Z Lyu, B Dai, F Yu, Y Qiao… - … on Computer Vision, 2024 - Springer
We present DiffBIR, a general restoration pipeline that could handle different blind image
restoration tasks in a unified framework. DiffBIR decouples blind image restoration problem …

Pixel-aware stable diffusion for realistic image super-resolution and personalized stylization

T Yang, R Wu, P Ren, X **e, L Zhang - European Conference on Computer …, 2024 - Springer
Diffusion models have demonstrated impressive performance in various image generation,
editing, enhancement and translation tasks. In particular, the pre-trained text-to-image stable …

Codetalker: Speech-driven 3d facial animation with discrete motion prior

J **ng, M **a, Y Zhang, X Cun… - Proceedings of the …, 2023 - openaccess.thecvf.com
Speech-driven 3D facial animation has been widely studied, yet there is still a gap to
achieving realism and vividness due to the highly ill-posed nature and scarcity of audio …

Ridcp: Revitalizing real image dehazing via high-quality codebook priors

RQ Wu, ZP Duan, CL Guo… - Proceedings of the …, 2023 - openaccess.thecvf.com
Existing dehazing approaches struggle to process real-world hazy images owing to the lack
of paired real data and robust priors. In this work, we present a new paradigm for real image …

Dr2: Diffusion-based robust degradation remover for blind face restoration

Z Wang, Z Zhang, X Zhang, H Zheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Blind face restoration usually synthesizes degraded low-quality data with a pre-defined
degradation model for training, while more complex cases could happen in the real world …

Difface: Blind face restoration with diffused error contraction

Z Yue, CC Loy - IEEE Transactions on Pattern Analysis and …, 2024 - ieeexplore.ieee.org
While deep learning-based methods for blind face restoration have achieved
unprecedented success, they still suffer from two major limitations. First, most of them …