- Academic Search

P Xu, X Zhu, DA Clifton - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Transformer is a promising neural network learner, and has achieved great success in
various machine learning tasks. Thanks to the recent prevalence of multimodal applications …

Save Cite Cited by 625 Related articles All 9 versions Free GPT-4

[Free GPT-4]

[PDF] ecva.net

`TEMOS`: Generating Diverse Human Motions from Textual Descriptions

M Petrovich, MJ Black, G Varol - European Conference on Computer …, 2022 - Springer

We address the problem of generating diverse 3D human motions from textual descriptions.
This challenging task requires joint modeling of both modalities: understanding and …

Save Cite Cited by 333 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Sadtalker: Learning realistic 3d motion coefficients for stylized audio-driven single image talking face animation

W Zhang, X Cun, X Wang, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Generating talking head videos through a face image and a piece of speech audio still
contains many challenges. ie, unnatural head movement, distorted expression, and identity …

Save Cite Cited by 241 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mdpi.com

Human-computer interaction system: A survey of talking-head generation

R Zhen, W Song, Q He, J Cao, L Shi, J Luo - Electronics, 2023 - mdpi.com

Virtual human is widely employed in various industries, including personal assistance,
intelligent customer service, and online education, thanks to the rapid development of …

Save Cite Cited by 51 Related articles All 3 versions Free GPT-4 Cached

[Free GPT-4]

[PDF] arxiv.org

Emo: Emote portrait alive generating expressive portrait videos with audio2video diffusion model under weak conditions

L Tian, Q Wang, B Zhang, L Bo - European Conference on Computer …, 2024 - Springer

In this work, we tackle the challenge of enhancing the realism and expressiveness in talking
head video generation by focusing on the dynamic and nuanced relationship between audio …

Save Cite Cited by 90 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Generating holistic 3d human motion from speech

H Yi, H Liang, Y Liu, Q Cao, Y Wen… - Proceedings of the …, 2023 - openaccess.thecvf.com

This work addresses the problem of generating 3D holistic body motions from human
speech. Given a speech recording, we synthesize sequences of 3D body poses, hand …

Save Cite Cited by 131 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Codetalker: Speech-driven 3d facial animation with discrete motion prior

J **ng, M **a, Y Zhang, X Cun… - Proceedings of the …, 2023 - openaccess.thecvf.com

Speech-driven 3D facial animation has been widely studied, yet there is still a gap to
achieving realism and vividness due to the highly ill-posed nature and scarcity of audio …

Save Cite Cited by 148 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Deep learning for visual speech analysis: A survey

C Sheng, G Kuang, L Bai, C Hou, Y Guo… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Visual speech, referring to the visual domain of speech, has attracted increasing attention
due to its wide applications, such as public security, medical treatment, military defense, and …

Save Cite Cited by 46 Related articles All 9 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Emotalk: Speech-driven emotional disentanglement for 3d face animation

Z Peng, H Wu, Z Song, H Xu, X Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Speech-driven 3D face animation aims to generate realistic facial expressions that match
the speech content and emotion. However, existing methods often neglect emotional facial …

Save Cite Cited by 92 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Seeing what you said: Talking face generation guided by a lip reading expert

J Wang, X Qian, M Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Talking face generation, also known as speech-to-lip generation, reconstructs facial motions
concerning lips given coherent speech input. The previous studies revealed the importance …

Save Cite Cited by 82 Related articles All 6 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Faceformer: Speech-driven 3d facial animation with transformers

Multimodal learning with transformers: A survey

`TEMOS`: Generating Diverse Human Motions from Textual Descriptions

Sadtalker: Learning realistic 3d motion coefficients for stylized audio-driven single image talking face animation

Human-computer interaction system: A survey of talking-head generation

Emo: Emote portrait alive generating expressive portrait videos with audio2video diffusion model under weak conditions

Generating holistic 3d human motion from speech

Codetalker: Speech-driven 3d facial animation with discrete motion prior

Deep learning for visual speech analysis: A survey

Emotalk: Speech-driven emotional disentanglement for 3d face animation

Seeing what you said: Talking face generation guided by a lip reading expert