Lavie: High-quality video generation with cascaded latent diffusion models
This work aims to learn a high-quality text-to-video (T2V) generative model by leveraging a
pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task …
pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task …
Exploiting diffusion prior for real-world image super-resolution
We present a novel approach to leverage prior knowledge encapsulated in pre-trained text-
to-image diffusion models for blind super-resolution. Specifically, by employing our time …
to-image diffusion models for blind super-resolution. Specifically, by employing our time …
Stylegan-human: A data-centric odyssey of human generation
Unconditional human image generation is an important task in vision and graphics, enabling
various applications in the creative industry. Existing studies in this field mainly focus on …
various applications in the creative industry. Existing studies in this field mainly focus on …
Glean: Generative latent bank for large-factor image super-resolution
We show that pre-trained Generative Adversarial Networks (GANs), eg, StyleGAN, can be
used as a latent bank to improve the restoration quality of large-factor image super …
used as a latent bank to improve the restoration quality of large-factor image super …
Reference-based image super-resolution with deformable attention transformer
Reference-based image super-resolution (RefSR) aims to exploit auxiliary reference (Ref)
images to super-resolve low-resolution (LR) images. Recently, RefSR has been attracting …
images to super-resolve low-resolution (LR) images. Recently, RefSR has been attracting …
Refsr-nerf: Towards high fidelity and super resolution view synthesis
Abstract We present Reference-guided Super-Resolution Neural Radiance Field (RefSR-
NeRF) that extends NeRF to super resolution and photorealistic novel view synthesis …
NeRF) that extends NeRF to super resolution and photorealistic novel view synthesis …
Hqg-net: Unpaired medical image enhancement with high-quality guidance
Unpaired medical image enhancement (UMIE) aims to transform a low-quality (LQ) medical
image into a high-quality (HQ) one without relying on paired images for training. While most …
image into a high-quality (HQ) one without relying on paired images for training. While most …
Dynast: Dynamic sparse transformer for exemplar-guided image generation
One key challenge of exemplar-guided image generation lies in establishing fine-grained
correspondences between input and guided images. Prior approaches, despite the …
correspondences between input and guided images. Prior approaches, despite the …
Real-world blind super-resolution via feature matching with implicit high-resolution priors
A key challenge of real-world image super-resolution (SR) is to recover the missing details
in low-resolution (LR) images with complex unknown degradations (\eg, downsampling …
in low-resolution (LR) images with complex unknown degradations (\eg, downsampling …
Any-resolution training for high-resolution image synthesis
Generative models operate at fixed resolution, even though natural images come in a variety
of sizes. As high-resolution details are downsampled away and low-resolution images are …
of sizes. As high-resolution details are downsampled away and low-resolution images are …