- Academic Search

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

Enregistrer Citer Cité 68 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era

TT Nguyen, Z Ren, T Pham, PL Nguyen, H Yin… - arxiv preprint arxiv …, 2024 - arxiv.org

The rapid advancement of large language models (LLMs) and multimodal learning has
transformed digital content creation and manipulation. Traditional visual editing tools require …

Enregistrer Citer Cité 1 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment

S Bounareli, C Tzelepis, V Argyriou, I Patras… - arxiv preprint arxiv …, 2024 - arxiv.org

Video-driven neural face reenactment aims to synthesize realistic facial images that
successfully preserve the identity and appearance of a source face, while transferring the …

Enregistrer Citer Cité 4 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Unsupervised discovery of interpretable directions in h-space of pre-trained diffusion models

Z Zhang, L Liu, Z Lin, Y Zhu, Z Zhao - arxiv preprint arxiv:2310.09912, 2023 - arxiv.org

We propose the first unsupervised and learning-based method to identify interpretable
directions in h-space of pre-trained diffusion models. Our method is derived from an existing …

Enregistrer Citer Cité 3 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Joint Learning of Depth and Appearance for Portrait Image Animation

X Ji, G Zoss, P Chandran, L Yang, X Cao… - arxiv preprint arxiv …, 2025 - arxiv.org

2D portrait animation has experienced significant advancements in recent years. Much
research has utilized the prior knowledge embedded in large generative diffusion models to …

Enregistrer Citer Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering

D Qin, H Lin, Q Zhang, K Qiao, L Zhang, Z Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org

We propose GauFace, a novel Gaussian Splatting representation, tailored for efficient
animation and rendering of physically-based facial assets. Leveraging strong geometric …

Enregistrer Citer Autres articles Les 3 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Coie: Chain-of-instruct editing for multi-attribute face manipulation

Z Zhang, BW Zhang, G Liu - arxiv preprint arxiv:2312.07879, 2023 - arxiv.org

Current text-to-image editing models often encounter challenges with smoothly manipulating
multiple attributes using a single instruction. Taking inspiration from the Chain-of-Thought …

Enregistrer Citer Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

ICE: Interactive 3D Game Character Editing via Dialogue

H Wu, M Zhao, Z Hu, L Li, W Chen, R Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org

ost recent popular Role-Playing Games (RPGs) allow players to create in-game characters
with hundreds of adjustable parameters, including bone positions and various makeup …

Enregistrer Citer Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] ntu.edu.sg

Unsupervised learning with diffusion models

J Wang - 2023 - dr.ntu.edu.sg

In computer vision, a key goal is to obtain visual representations that faithfully capture the
underlying structure and semantics of the data, encompassing object identities, positions …

Enregistrer Citer Autres articles Les 2 versions Free GPT-4 Version HTML