Visualization and visual analytics approaches for image and video datasets: A survey

S Afzal, S Ghani, MM Hittawe, SF Rashid… - ACM Transactions on …, 2023 - dl.acm.org
Image and video data analysis has become an increasingly important research area with
applications in different domains such as security surveillance, healthcare, augmented and …

Sadtalker: Learning realistic 3d motion coefficients for stylized audio-driven single image talking face animation

W Zhang, X Cun, X Wang, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Generating talking head videos through a face image and a piece of speech audio still
contains many challenges. ie, unnatural head movement, distorted expression, and identity …

Emo: Emote portrait alive generating expressive portrait videos with audio2video diffusion model under weak conditions

L Tian, Q Wang, B Zhang, L Bo - European Conference on Computer …, 2024 - Springer
In this work, we tackle the challenge of enhancing the realism and expressiveness in talking
head video generation by focusing on the dynamic and nuanced relationship between audio …

Styleheat: One-shot high-resolution editable talking face generation via pre-trained stylegan

F Yin, Y Zhang, X Cun, M Cao, Y Fan, X Wang… - European conference on …, 2022 - Springer
One-shot talking face generation aims at synthesizing a high-quality talking face video from
an arbitrary portrait image, driven by a video or an audio segment. In this work, we provide a …

Pirenderer: Controllable portrait image generation via semantic neural rendering

Y Ren, G Li, Y Chen, TH Li… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Generating portrait images by controlling the motions of existing faces is an important task of
great consequence to social media industries. For easy use and intuitive control …

Audio-driven emotional video portraits

X Ji, H Zhou, K Wang, W Wu, CC Loy… - Proceedings of the …, 2021 - openaccess.thecvf.com
Despite previous success in generating audio-driven talking heads, most of the previous
studies focus on the correlation between speech content and the mouth shape. Facial …

Facial: Synthesizing dynamic talking face with implicit attribute learning

C Zhang, Y Zhao, Y Huang, M Zeng… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we propose a talking face generation method that takes an audio signal as
input and a short target video clip as reference, and synthesizes a photo-realistic video of …

Dfa-nerf: Personalized talking head generation via disentangled face attributes neural rendering

S Yao, RZ Zhong, Y Yan, G Zhai, X Yang - arxiv preprint arxiv:2201.00791, 2022 - arxiv.org
While recent advances in deep neural networks have made it possible to render high-quality
images, generating photo-realistic and personalized talking head remains challenging. With …

Deep image synthesis from intuitive user input: A review and perspectives

Y Xue, YC Guo, H Zhang, T Xu, SH Zhang… - Computational Visual …, 2022 - Springer
In many applications of computer graphics, art, and design, it is desirable for a user to
provide intuitive non-image input, such as text, sketch, stroke, graph, or layout, and have a …

Videoretalking: Audio-based lip synchronization for talking head video editing in the wild

K Cheng, X Cun, Y Zhang, M **a, F Yin, M Zhu… - SIGGRAPH Asia 2022 …, 2022 - dl.acm.org
We present VideoReTalking, a new system to edit the faces of a real-world talking head
video according to input audio, producing a high-quality and lip-syncing output video even …