[HTML][HTML] Audio-Driven Facial Animation with Deep Learning: A Survey

D Jiang, J Chang, L You, S Bian, R Kosk, G Maguire - Information, 2024 - mdpi.com
Audio-driven facial animation is a rapidly evolving field that aims to generate realistic facial
expressions and lip movements synchronized with a given audio input. This survey provides …

Human motion video generation: A survey

H Xue, X Luo, Z Hu, X Zhang, X **ang, Y Dai, J Liu… - Authorea …, 2024 - techrxiv.org
Human motion video generation has garnered significant research interest due to its broad
applications, enabling innovations such as photorealistic singing heads or dynamic avatars …

Sonic: Shifting Focus to Global Audio Perception in Portrait Animation

X Ji, X Hu, Z Xu, J Zhu, C Lin, Q He, J Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
The study of talking face generation mainly explores the intricacies of synchronizing facial
movements and crafting visually appealing, temporally-coherent animations. However, due …

Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis

P Salehi, SA Sheshkal, V Thambawita… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper examines the integration of real-time talking-head generation for interviewer
training, focusing on overcoming challenges in Audio Feature Extraction (AFE), which often …

Orator: LLM-Guided Multi-Shot Speech Video Generation

J Chen, Y Fu, A Zeng, Z Wang, S Cen, X Yu, J Tanke… - openreview.net
In this work, we propose a novel system for automatically generating multi-shot speech
videos with natural camera transitions, using input text lines and reference images from …