Drag your gan: Interactive point-based manipulation on the generative image manifold

X Pan, A Tewari, T Leimkühler, L Liu, A Meka… - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org
Synthesizing visual content that meets users' needs often requires flexible and precise
controllability of the pose, shape, expression, and layout of the generated objects. Existing …

Survey on leveraging pre-trained generative adversarial networks for image editing and restoration

M Liu, Y Wei, X Wu, W Zuo, L Zhang - Science China Information Sciences, 2023 - Springer
Generative adversarial networks (GANs) have drawn enormous attention due to their simple
yet effective training mechanism and superior image generation quality. With the ability to …

Stylegan-v: A continuous video generator with the price, image quality and perks of stylegan2

I Skorokhodov, S Tulyakov… - Proceedings of the …, 2022 - openaccess.thecvf.com
Videos show continuous events, yet most--if not all--video synthesis frameworks treat them
discretely in time. In this work, we think of videos of what they should be--time-continuous …

Epigraf: Rethinking training of 3d gans

I Skorokhodov, S Tulyakov, Y Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc
A recent trend in generative modeling is building 3D-aware generators from 2D image
collections. To induce the 3D bias, such models typically rely on volumetric rendering, which …

Iti-gen: Inclusive text-to-image generation

C Zhang, X Chen, S Chai, CH Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-to-image generative models often reflect the biases of the training data, leading to
unequal representations of underrepresented groups. This study investigates inclusive text …

Zigma: A dit-style zigzag mamba diffusion model

VT Hu, SA Baumann, M Gui, O Grebenkova… - … on Computer Vision, 2024 - Springer
The diffusion model has long been plagued by scalability and quadratic complexity issues,
especially within transformer-based structures. In this study, we aim to leverage the long …

Diffcollage: Parallel generation of large content with diffusion models

Q Zhang, J Song, X Huang, Y Chen… - 2023 IEEE/CVF …, 2023 - ieeexplore.ieee.org
We present DiffCollage, a compositional diffusion model that can generate large content by
leveraging diffusion models trained on generating pieces of the large content. Our approach …

Frido: Feature pyramid diffusion for complex scene image synthesis

WC Fan, YC Chen, DD Chen, Y Cheng… - Proceedings of the …, 2023 - ojs.aaai.org
Diffusion models (DMs) have shown great potential for high-quality image synthesis.
However, when it comes to producing images with complex scenes, how to properly …

Infinicity: Infinite-scale city synthesis

CH Lin, HY Lee, W Menapace, M Chai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Toward infinite-scale 3D city synthesis, we propose a novel framework, InfiniCity, which
constructs and renders an unconstrainedly large and 3D-grounded environment from …

Hierarchical patch diffusion models for high-resolution video generation

I Skorokhodov, W Menapace… - Proceedings of the …, 2024 - openaccess.thecvf.com
Diffusion models have demonstrated remarkable performance in image and video synthesis.
However scaling them to high-resolution inputs is challenging and requires restructuring the …