- Academic Search

Grounded text-to-image synthesis with attention refocusing

Q Phung, S Ge, JB Huang - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Driven by the scalable diffusion models trained on large-scale datasets text-to-image
synthesis methods have shown compelling results. However these models still fail to …

Save Cite Cited by 84 Related articles All 3 versions Free GPT-4 View as HTML

Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

Save Cite Cited by 68 Related articles All 2 versions Free GPT-4 View as HTML

Renoise: Real image inversion through iterative noising

D Garibi, O Patashnik, A Voynov… - … on Computer Vision, 2024 - Springer

Recent advancements in text-guided diffusion models have unlocked powerful image
manipulation capabilities. However, applying these methods to real images necessitates the …

Save Cite Cited by 28 Related articles All 2 versions Free GPT-4

Cross-image attention for zero-shot appearance transfer

Y Alaluf, D Garibi, O Patashnik… - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org

Recent advancements in text-to-image generative models have demonstrated a remarkable
ability to capture a deep semantic understanding of images. In this work, we leverage this …

Save Cite Cited by 43 Related articles All 2 versions Free GPT-4

Freecontrol: Training-free spatial control of any text-to-image diffusion model with any condition

S Mo, F Mu, KH Lin, Y Liu, B Guan… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recent approaches such as ControlNet offer users fine-grained spatial control over text-to-
image (T2I) diffusion models. However auxiliary modules have to be trained for each spatial …

Save Cite Cited by 22 Related articles All 4 versions Free GPT-4 View as HTML

Boosting consistency in story visualization with rich-contextual conditional diffusion models

F Shen, H Ye, S Liu, J Zhang, C Wang, X Han… - arxiv preprint arxiv …, 2024 - arxiv.org

Recent research showcases the considerable potential of conditional diffusion models for
generating consistent stories. However, current methods, which predominantly generate …

Save Cite Cited by 29 Related articles All 3 versions Free GPT-4 View as HTML

Portraitbooth: A versatile portrait model for fast identity-preserved personalization

X Peng, J Zhu, B Jiang, Y Tai, D Luo… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recent advancements in personalized image generation using diffusion models have been
noteworthy. However existing methods suffer from inefficiencies due to the requirement for …

Save Cite Cited by 31 Related articles All 3 versions Free GPT-4 View as HTML

It's All About Your Sketch: Democratising Sketch Control in Diffusion Models

S Koley, AK Bhunia, D Sekhri, A Sain… - Proceedings of the …, 2024 - openaccess.thecvf.com

This paper unravels the potential of sketches for diffusion models addressing the deceptive
promise of direct sketch control in generative AI. We importantly democratise the process …

Save Cite Cited by 16 Related articles All 4 versions Free GPT-4 View as HTML