Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era

TT Nguyen, Z Ren, T Pham, PL Nguyen, H Yin… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid advancement of large language models (LLMs) and multimodal learning has
transformed digital content creation and manipulation. Traditional visual editing tools require …

DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment

S Bounareli, C Tzelepis, V Argyriou, I Patras… - arxiv preprint arxiv …, 2024 - arxiv.org
Video-driven neural face reenactment aims to synthesize realistic facial images that
successfully preserve the identity and appearance of a source face, while transferring the …

Unsupervised discovery of interpretable directions in h-space of pre-trained diffusion models

Z Zhang, L Liu, Z Lin, Y Zhu, Z Zhao - arxiv preprint arxiv:2310.09912, 2023 - arxiv.org
We propose the first unsupervised and learning-based method to identify interpretable
directions in h-space of pre-trained diffusion models. Our method is derived from an existing …

Joint Learning of Depth and Appearance for Portrait Image Animation

X Ji, G Zoss, P Chandran, L Yang, X Cao… - arxiv preprint arxiv …, 2025 - arxiv.org
2D portrait animation has experienced significant advancements in recent years. Much
research has utilized the prior knowledge embedded in large generative diffusion models to …

Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering

D Qin, H Lin, Q Zhang, K Qiao, L Zhang, Z Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org
We propose GauFace, a novel Gaussian Splatting representation, tailored for efficient
animation and rendering of physically-based facial assets. Leveraging strong geometric …

Coie: Chain-of-instruct editing for multi-attribute face manipulation

Z Zhang, BW Zhang, G Liu - arxiv preprint arxiv:2312.07879, 2023 - arxiv.org
Current text-to-image editing models often encounter challenges with smoothly manipulating
multiple attributes using a single instruction. Taking inspiration from the Chain-of-Thought …

ICE: Interactive 3D Game Character Editing via Dialogue

H Wu, M Zhao, Z Hu, L Li, W Chen, R Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org
ost recent popular Role-Playing Games (RPGs) allow players to create in-game characters
with hundreds of adjustable parameters, including bone positions and various makeup …

Unsupervised learning with diffusion models

J Wang - 2023 - dr.ntu.edu.sg
In computer vision, a key goal is to obtain visual representations that faithfully capture the
underlying structure and semantics of the data, encompassing object identities, positions …

[CITATION][C] CoralStyleCLIP: Region and Layer Optimization for Image Editing

Z Huma, H Azmat - Engineering Frontier Studies, 2024