Preserve your own correlation: A noise prior for video diffusion models

S Ge, S Nah, G Liu, T Poon, A Tao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite tremendous progress in generating high-quality images using diffusion models,
synthesizing a sequence of animated frames that are both photorealistic and temporally …

Grounded text-to-image synthesis with attention refocusing

Q Phung, S Ge, JB Huang - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Driven by the scalable diffusion models trained on large-scale datasets text-to-image
synthesis methods have shown compelling results. However these models still fail to …

Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

Renoise: Real image inversion through iterative noising

D Garibi, O Patashnik, A Voynov… - … on Computer Vision, 2024 - Springer
Recent advancements in text-guided diffusion models have unlocked powerful image
manipulation capabilities. However, applying these methods to real images necessitates the …

Cross-image attention for zero-shot appearance transfer

Y Alaluf, D Garibi, O Patashnik… - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org
Recent advancements in text-to-image generative models have demonstrated a remarkable
ability to capture a deep semantic understanding of images. In this work, we leverage this …

Freecontrol: Training-free spatial control of any text-to-image diffusion model with any condition

S Mo, F Mu, KH Lin, Y Liu, B Guan… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent approaches such as ControlNet offer users fine-grained spatial control over text-to-
image (T2I) diffusion models. However auxiliary modules have to be trained for each spatial …

Boosting consistency in story visualization with rich-contextual conditional diffusion models

F Shen, H Ye, S Liu, J Zhang, C Wang, X Han… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent research showcases the considerable potential of conditional diffusion models for
generating consistent stories. However, current methods, which predominantly generate …

Portraitbooth: A versatile portrait model for fast identity-preserved personalization

X Peng, J Zhu, B Jiang, Y Tai, D Luo… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advancements in personalized image generation using diffusion models have been
noteworthy. However existing methods suffer from inefficiencies due to the requirement for …

It's All About Your Sketch: Democratising Sketch Control in Diffusion Models

S Koley, AK Bhunia, D Sekhri, A Sain… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper unravels the potential of sketches for diffusion models addressing the deceptive
promise of direct sketch control in generative AI. We importantly democratise the process …

Bridging different language models and generative vision models for text-to-image generation

S Zhao, S Hao, B Zi, H Xu, KYK Wong - European Conference on …, 2024 - Springer
Text-to-image generation has made significant advancements with the introduction of text-to-
image diffusion models. These models typically consist of a language model that interprets …