Diffusion model-based image editing: A survey
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …
Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era
The rapid advancement of large language models (LLMs) and multimodal learning has
transformed digital content creation and manipulation. Traditional visual editing tools require …
transformed digital content creation and manipulation. Traditional visual editing tools require …
DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment
Video-driven neural face reenactment aims to synthesize realistic facial images that
successfully preserve the identity and appearance of a source face, while transferring the …
successfully preserve the identity and appearance of a source face, while transferring the …
Unsupervised discovery of interpretable directions in h-space of pre-trained diffusion models
We propose the first unsupervised and learning-based method to identify interpretable
directions in h-space of pre-trained diffusion models. Our method is derived from an existing …
directions in h-space of pre-trained diffusion models. Our method is derived from an existing …
Joint Learning of Depth and Appearance for Portrait Image Animation
2D portrait animation has experienced significant advancements in recent years. Much
research has utilized the prior knowledge embedded in large generative diffusion models to …
research has utilized the prior knowledge embedded in large generative diffusion models to …
Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering
We propose GauFace, a novel Gaussian Splatting representation, tailored for efficient
animation and rendering of physically-based facial assets. Leveraging strong geometric …
animation and rendering of physically-based facial assets. Leveraging strong geometric …
Coie: Chain-of-instruct editing for multi-attribute face manipulation
Current text-to-image editing models often encounter challenges with smoothly manipulating
multiple attributes using a single instruction. Taking inspiration from the Chain-of-Thought …
multiple attributes using a single instruction. Taking inspiration from the Chain-of-Thought …
ICE: Interactive 3D Game Character Editing via Dialogue
ost recent popular Role-Playing Games (RPGs) allow players to create in-game characters
with hundreds of adjustable parameters, including bone positions and various makeup …
with hundreds of adjustable parameters, including bone positions and various makeup …
Unsupervised learning with diffusion models
J Wang - 2023 - dr.ntu.edu.sg
In computer vision, a key goal is to obtain visual representations that faithfully capture the
underlying structure and semantics of the data, encompassing object identities, positions …
underlying structure and semantics of the data, encompassing object identities, positions …