Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

LEGO: L earning EGO centric Action Frame Generation via Visual Instruction Tuning

B Lai, X Dai, L Chen, G Pang, JM Rehg… - European Conference on …, 2024 - Springer
Generating instructional images of human daily actions from an egocentric viewpoint serves
as a key step towards efficient skill transfer. In this paper, we introduce a novel problem …

BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models

F Wang, H Yin, Y Dong, H Zhu, C Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
The inversion of diffusion model sampling, which aims to find the corresponding initial noise
of a sample, plays a critical role in various tasks. Recently, several heuristic exact inversion …

Conditional Image Synthesis with Diffusion Models: A Survey

Z Zhan, D Chen, JP Mei, Z Zhao, J Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
Conditional image synthesis based on user-specified requirements is a key component in
creating complex visual content. In recent years, diffusion-based generative modeling has …

Exploring the latent space of diffusion models directly through singular value decomposition

L Wang, B Gao, Y Li, Z Wang, X Yang… - arxiv preprint arxiv …, 2025 - arxiv.org
Despite the groundbreaking success of diffusion models in generating high-fidelity images,
their latent space remains relatively under-explored, even though it holds significant promise …

Addressing Attribute Leakages in Diffusion-based Image Editing without Training

S Mun, J Nam, S Cho, J Ok - arxiv preprint arxiv:2412.04715, 2024 - arxiv.org
Diffusion models have become a cornerstone in image editing, offering flexibility with
language prompts and source images. However, a key challenge is attribute leakage, where …

DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation

J Wang, C Wang, T Cao, J Huang, L ** - arxiv preprint arxiv:2403.04997, 2024 - arxiv.org
We present DiffChat, a novel method to align Large Language Models (LLMs) to" chat" with
prompt-as-input Text-to-Image Synthesis (TIS) models (eg, Stable Diffusion) for interactive …