Maskgit: Masked generative image transformer

H Chang, H Zhang, L Jiang, C Liu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Generative transformers have experienced rapid popularity growth in the computer vision
community in synthesizing high-fidelity and high-resolution images. The best generative …

Palette: Image-to-image diffusion models

C Saharia, W Chan, H Chang, C Lee, J Ho… - ACM SIGGRAPH 2022 …, 2022 - dl.acm.org
This paper develops a unified framework for image-to-image translation based on
conditional diffusion models and evaluates this framework on four challenging image-to …

Gan inversion: A survey

W **a, Y Zhang, Y Yang, JH Xue… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
GAN inversion aims to invert a given image back into the latent space of a pretrained GAN
model so that the image can be faithfully reconstructed from the inverted code by the …

A task is worth one word: Learning with task prompts for high-quality versatile image inpainting

J Zhuang, Y Zeng, W Liu, C Yuan, K Chen - European Conference on …, 2024 - Springer
Advancing image inpainting is challenging as it requires filling user-specified regions for
various intents, such as background filling and object synthesis. Existing approaches focus …

Regularizing generative adversarial networks under limited data

HY Tseng, L Jiang, C Liu, MH Yang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Recent years have witnessed the rapid progress of generative adversarial networks (GANs).
However, the success of the GAN models hinges on a large amount of training data. This …

Avid: Any-length video inpainting with diffusion model

Z Zhang, B Wu, X Wang, Y Luo… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advances in diffusion models have successfully enabled text-guided image
inpainting. While it seems straightforward to extend such editing capability into the video …

Semcity: Semantic scene generation with triplane diffusion

J Lee, S Lee, C Jo, W Im, J Seon… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present" SemCity" a 3D diffusion model for semantic scene generation in real-world
outdoor environments. Most 3D diffusion models focus on generating a single object …

3davatargan: Bridging domains for personalized editable avatars

R Abdal, HY Lee, P Zhu, M Chai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Modern 3D-GANs synthesize geometry and texture by training on large-scale datasets with
a consistent structure. Training such models on stylized, artistic data, with often unknown …

Any-resolution training for high-resolution image synthesis

L Chai, M Gharbi, E Shechtman, P Isola… - European conference on …, 2022 - Springer
Generative models operate at fixed resolution, even though natural images come in a variety
of sizes. As high-resolution details are downsampled away and low-resolution images are …

360dvd: Controllable panorama video generation with 360-degree video diffusion model

Q Wang, W Li, C Mou, X Cheng… - Proceedings of the …, 2024 - openaccess.thecvf.com
Panorama video recently attracts more interest in both study and application courtesy of its
immersive experience. Due to the expensive cost of capturing 360-degree panoramic videos …