Identity-preserving talking face generation with landmark and appearance priors

W Zhong, C Fang, Y Cai, P Wei… - Proceedings of the …, 2023 - openaccess.thecvf.com
Generating talking face videos from audio attracts lots of research interest. A few person-
specific methods can generate vivid videos but require the target speaker's videos for …

Efficient emotional adaptation for audio-driven talking-head generation

Y Gan, Z Yang, X Yue, L Sun… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Audio-driven talking-head synthesis is a popular research topic for virtual human-related
applications. However, the inflexibility and inefficiency of existing methods, which …

Deepfake generation and detection: A benchmark and survey

G Pei, J Zhang, M Hu, Z Zhang, C Wang, Y Wu… - arxiv preprint arxiv …, 2024 - arxiv.org
Deepfake is a technology dedicated to creating highly realistic facial images and videos
under specific conditions, which has significant application potential in fields such as …

Robust one-shot face video re-enactment using hybrid latent spaces of stylegan2

T Oorloff, Y Yacoob - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Recent research on one-shot face re-enactment has progressively overcome the low-
resolution constraint with the help of StyleGAN's high-fidelity portrait generation. However …

Interactive conversational head generation

M Zhou, Y Bai, W Zhang, T Yao, T Zhao - arxiv preprint arxiv:2307.02090, 2023 - arxiv.org
We introduce a new conversation head generation benchmark for synthesizing behaviors of
a single interlocutor in a face-to-face conversation. The capability to automatically …

A Unified Approach for Occlusion Tolerant 3D Facial Pose Capture and Gaze Estimation Using MocapNETs

A Qammaz, AA Argyros - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
We tackle the challenging problems of 3D facial capture, head pose and gaze estimation.
We do so by extending MocapNET, a highly effective deep learning motion capture …

Fiancee: Faster inference of adversarial networks via conditional early exits

P Karpikova, E Radionova… - Proceedings of the …, 2023 - openaccess.thecvf.com
Generative DNNs are a powerful tool for image synthesis, but they are limited by their
computational load. On the other hand, given a trained model and a task, eg faces …

A survey on deep learning based reenactment methods for deepfake applications

R Dhanyalakshmi, CI Popirlan… - IET Image …, 2024 - Wiley Online Library
Among the sectors that deep learning has transformed, deepfake, a novel method of
manipulating multimedia, deserves particular attention. The long‐term objective of many …

3D Video Conferencing via On-hand Devices

Y **, X Duan, K Hu, F Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Video conferencing has become indispensable in human communication. Researchers are
exploring immersive capabilities to enhance video conferencing experiences by delivering …

Cascaded learning with transformer for simultaneous eye landmark, eye state and gaze estimation

C Gou, Y Yu, Z Guo, C **ong, M Cai - Pattern Recognition, 2024 - Elsevier
Eye tracking have garnered attention in human–machine interaction, disease monitoring,
biometrics, etc. Existing investigations for eye tracking have predominantly concentrated on …