- Academic Search

C Xu, Y Liu, J **-Once_Audio-driven_Portrait_Animation_with_Dual_Attentions_ICCV_2023_paper.pdf" data-clk="hl=fi&sa=T&oi=gga&ct=gga&cd=3&d=12958348119986055500&ei=mmK8Z7SVCJuoieoPvZLH-Qo" data-clk-atid="TKmed_VM1bMJ" target="_blank">[PDF] thecvf.com

Moda: Map**-once audio-driven portrait animation with dual attentions

Y Liu, L Lin, F Yu, C Zhou, Y Li - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Audio-driven portrait animation aims to synthesize portrait videos that are conditioned by
given audio. Animating high-fidelity and multimodal video portraits has a variety of …

Tallenna Viittaa Viittausten määrä 22 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Revisiting generalizability in deepfake detection: Improving metrics and stabilizing transfer

S Kamat, S Agarwal, T Darrell… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract" Generalizability" is seen as the hallmark quality of a good deepfake detection
model. However, standard out-of-domain evaluation datasets are very similar in form to the …

Tallenna Viittaa Viittausten määrä 8 Aiheeseen liittyviä artikkeleita Kaikki 3 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dagan++: Depth-aware generative adversarial network for talking head video generation

FT Hong, L Shen, D Xu - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Predominant techniques on talking head generation largely depend on 2D information,
including facial appearances and motions from input face images. Nevertheless, dense 3D …

Tallenna Viittaa Viittausten määrä 10 Aiheeseen liittyviä artikkeleita Kaikki 6 versiota

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multimodal-driven talking face generation via a unified diffusion-based generator

C Xu, S Zhu, J Zhu, T Huang, J Zhang, Y Tai… - arxiv preprint arxiv …, 2023 - arxiv.org

Multimodal-driven talking face generation refers to animating a portrait with the given pose,
expression, and gaze transferred from the driving image and video, or estimated from the …

Tallenna Viittaa Viittausten määrä 14 Aiheeseen liittyviä artikkeleita Kaikki 2 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Make your actor talk: Generalizable and high-fidelity lip sync with motion and appearance disentanglement

R Yu, T He, A Zhang, Y Wang, J Guo, X Tan… - arxiv preprint arxiv …, 2024 - arxiv.org

We aim to edit the lip movements in talking video according to the given speech while
preserving the personal identity and visual details. The task can be decomposed into two …

Tallenna Viittaa Viittausten määrä 5 Aiheeseen liittyviä artikkeleita Kaikki 2 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dreamhead: Learning spatial-temporal correspondence via hierarchical diffusion for audio-driven talking head synthesis

FT Hong, Y Liu, Y Li, C Zhou, F Yu, D Xu - arxiv preprint arxiv:2409.10281, 2024 - arxiv.org

Audio-driven talking head synthesis strives to generate lifelike video portraits from provided
audio. The diffusion model, recognized for its superior quality and robust generalization, has …

Tallenna Viittaa Viittausten määrä 2 Aiheeseen liittyviä artikkeleita Kaikki 2 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] brosdocs.net

EAT-Face: Emotion-Controllable Audio-Driven Talking Face Generation via Diffusion Model

H Wang, X Jia, X Cao - 2024 IEEE 18th International …, 2024 - ieeexplore.ieee.org

Audio-driven talking face generation is a promising task with a lot of attention. Despite
abundant efforts are devoted to video quality and lip synchronization, most existing works do …

Tallenna Viittaa Viittausten määrä 2 Aiheeseen liittyviä artikkeleita Kaikki 3 versiota

Luo ilmoitus

Viittaa

Tarkennettu haku

Tallennettu omaan kirjastoon

Difftalk: Crafting diffusion models for generalized talking head synthesis

Facechain-imagineid: Freely crafting high-fidelity diverse talking faces from disentangled audio

Moda: Map**-once audio-driven portrait animation with dual attentions

Revisiting generalizability in deepfake detection: Improving metrics and stabilizing transfer

Dagan++: Depth-aware generative adversarial network for talking head video generation

Multimodal-driven talking face generation via a unified diffusion-based generator

Make your actor talk: Generalizable and high-fidelity lip sync with motion and appearance disentanglement

Dreamhead: Learning spatial-temporal correspondence via hierarchical diffusion for audio-driven talking head synthesis

EAT-Face: Emotion-Controllable Audio-Driven Talking Face Generation via Diffusion Model