Diffusion model-based image editing: A survey
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …
LEGO: L earning EGO centric Action Frame Generation via Visual Instruction Tuning
Generating instructional images of human daily actions from an egocentric viewpoint serves
as a key step towards efficient skill transfer. In this paper, we introduce a novel problem …
as a key step towards efficient skill transfer. In this paper, we introduce a novel problem …
BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models
The inversion of diffusion model sampling, which aims to find the corresponding initial noise
of a sample, plays a critical role in various tasks. Recently, several heuristic exact inversion …
of a sample, plays a critical role in various tasks. Recently, several heuristic exact inversion …
Conditional Image Synthesis with Diffusion Models: A Survey
Conditional image synthesis based on user-specified requirements is a key component in
creating complex visual content. In recent years, diffusion-based generative modeling has …
creating complex visual content. In recent years, diffusion-based generative modeling has …
Exploring the latent space of diffusion models directly through singular value decomposition
L Wang, B Gao, Y Li, Z Wang, X Yang… - arxiv preprint arxiv …, 2025 - arxiv.org
Despite the groundbreaking success of diffusion models in generating high-fidelity images,
their latent space remains relatively under-explored, even though it holds significant promise …
their latent space remains relatively under-explored, even though it holds significant promise …
Addressing Attribute Leakages in Diffusion-based Image Editing without Training
Diffusion models have become a cornerstone in image editing, offering flexibility with
language prompts and source images. However, a key challenge is attribute leakage, where …
language prompts and source images. However, a key challenge is attribute leakage, where …
DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation
We present DiffChat, a novel method to align Large Language Models (LLMs) to" chat" with
prompt-as-input Text-to-Image Synthesis (TIS) models (eg, Stable Diffusion) for interactive …
prompt-as-input Text-to-Image Synthesis (TIS) models (eg, Stable Diffusion) for interactive …