- Academic Search

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Salva Cita Citato da 266 Articoli correlati Tutte e 11 le versioni

[Free GPT-4]

[PDF] springer.com

Generative artificial intelligence: a systematic review and applications

SS Sengar, AB Hasan, S Kumar, F Carroll - Multimedia Tools and …, 2024 - Springer

In recent years, the study of artificial intelligence (AI) has undergone a paradigm shift. This
has been propelled by the groundbreaking capabilities of generative models both in …

Salva Cita Citato da 38 Articoli correlati

[Free GPT-4]

[PDF] thecvf.com

Sadtalker: Learning realistic 3d motion coefficients for stylized audio-driven single image talking face animation

W Zhang, X Cun, X Wang, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Generating talking head videos through a face image and a piece of speech audio still
contains many challenges. ie, unnatural head movement, distorted expression, and identity …

Salva Cita Citato da 242 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Emo: Emote portrait alive generating expressive portrait videos with audio2video diffusion model under weak conditions

L Tian, Q Wang, B Zhang, L Bo - European Conference on Computer …, 2024 - Springer

In this work, we tackle the challenge of enhancing the realism and expressiveness in talking
head video generation by focusing on the dynamic and nuanced relationship between audio …

Salva Cita Citato da 90 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]

[PDF] thecvf.com

Generating holistic 3d human motion from speech

H Yi, H Liang, Y Liu, Q Cao, Y Wen… - Proceedings of the …, 2023 - openaccess.thecvf.com

This work addresses the problem of generating 3D holistic body motions from human
speech. Given a speech recording, we synthesize sequences of 3D body poses, hand …

Salva Cita Citato da 131 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] thecvf.com

Pose-controllable talking face generation by implicitly modularized audio-visual representation

H Zhou, Y Sun, W Wu, CC Loy… - Proceedings of the …, 2021 - openaccess.thecvf.com

While accurate lip synchronization has been achieved for arbitrary-subject audio-driven
talking face generation, the problem of how to efficiently drive the head pose remains …

Salva Cita Citato da 396 Articoli correlati Tutte e 9 le versioni Versione HTML

[Free GPT-4]

[PDF] thecvf.com

Expressive talking head generation with granular audio-visual control

B Liang, Y Pan, Z Guo, H Zhou… - Proceedings of the …, 2022 - openaccess.thecvf.com

Generating expressive talking heads is essential for creating virtual humans. However,
existing one-or few-shot methods focus on lip-sync and head motion, ignoring the emotional …

Salva Cita Citato da 134 Articoli correlati Tutte e 4 le versioni Versione HTML

[Free GPT-4]

[PDF] thecvf.com

Flow-guided one-shot talking face generation with a high-resolution audio-visual dataset

Z Zhang, L Li, Y Ding, C Fan - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

One-shot talking face generation should synthesize high visual quality facial videos with
reasonable animations of expression and head pose, and just utilize arbitrary driving audio …

Salva Cita Citato da 314 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Styleheat: One-shot high-resolution editable talking face generation via pre-trained stylegan

F Yin, Y Zhang, X Cun, M Cao, Y Fan, X Wang… - European conference on …, 2022 - Springer

One-shot talking face generation aims at synthesizing a high-quality talking face video from
an arbitrary portrait image, driven by a video or an audio segment. In this work, we provide a …

Salva Cita Citato da 168 Articoli correlati Tutte e 6 le versioni

[Free GPT-4]

[PDF] thecvf.com

Diffused heads: Diffusion models beat gans on talking-face generation

M Stypułkowski, K Vougioukas, S He… - Proceedings of the …, 2024 - openaccess.thecvf.com

Talking face generation has historically struggled to produce head movements and natural
facial expressions without guidance from additional reference videos. Recent developments …

Salva Cita Citato da 128 Articoli correlati Tutte e 6 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Makelttalk: speaker-aware talking-head animation

Multimodal image synthesis and editing: A survey and taxonomy

Generative artificial intelligence: a systematic review and applications

Sadtalker: Learning realistic 3d motion coefficients for stylized audio-driven single image talking face animation

Emo: Emote portrait alive generating expressive portrait videos with audio2video diffusion model under weak conditions

Generating holistic 3d human motion from speech

Pose-controllable talking face generation by implicitly modularized audio-visual representation

Expressive talking head generation with granular audio-visual control

Flow-guided one-shot talking face generation with a high-resolution audio-visual dataset

Styleheat: One-shot high-resolution editable talking face generation via pre-trained stylegan

Diffused heads: Diffusion models beat gans on talking-face generation