Deep facial expression recognition: A survey

S Li, W Deng - IEEE transactions on affective computing, 2020 - ieeexplore.ieee.org
With the transition of facial expression recognition (FER) from laboratory-controlled to
challenging in-the-wild conditions and the recent success of deep learning techniques in …

A survey on face data augmentation for the training of deep neural networks

X Wang, K Wang, S Lian - Neural computing and applications, 2020 - Springer
The quality and size of training set have a great impact on the results of deep learning-
based face-related tasks. However, collecting and labeling adequate samples with high …

Audio-driven emotional video portraits

X Ji, H Zhou, K Wang, W Wu, CC Loy… - Proceedings of the …, 2021 - openaccess.thecvf.com
Despite previous success in generating audio-driven talking heads, most of the previous
studies focus on the correlation between speech content and the mouth shape. Facial …

Mead: A large-scale audio-visual dataset for emotional talking-face generation

K Wang, Q Wu, L Song, Z Yang, W Wu, C Qian… - … on Computer Vision, 2020 - Springer
The synthesis of natural emotional reactions is an essential criterion in vivid talking-face
video generation. This criterion is nevertheless seldom taken into consideration in previous …

Hologan: Unsupervised learning of 3d representations from natural images

T Nguyen-Phuoc, C Li, L Theis… - Proceedings of the …, 2019 - openaccess.thecvf.com
We propose a novel generative adversarial network (GAN) for the task of unsupervised
learning of 3D representations from natural images. Most generative models rely on 2D …

Fakecatcher: Detection of synthetic portrait videos using biological signals

UA Ciftci, I Demir, L Yin - IEEE transactions on pattern analysis …, 2020 - ieeexplore.ieee.org
The recent proliferation of fake portrait videos poses direct threats on society, law, and
privacy [1]. Believing the fake video of a politician, distributing fake pornographic content of …

X2face: A network for controlling face generation using images, audio, and pose codes

O Wiles, A Koepke, A Zisserman - Proceedings of the …, 2018 - openaccess.thecvf.com
The objective of this paper is a neural network model that controls the pose and expression
of a given face, using another face or modality (eg audio). This model can then be used for …

High-fidelity generalized emotional talking face generation with multi-modal emotion space learning

C Xu, J Zhu, J Zhang, Y Han, W Chu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, emotional talking face generation has received considerable attention. However,
existing methods only adopt one-hot coding, image, or audio as emotion conditions, thus …

Countering malicious deepfakes: Survey, battleground, and horizon

F Juefei-Xu, R Wang, Y Huang, Q Guo, L Ma… - International journal of …, 2022 - Springer
The creation or manipulation of facial appearance through deep generative approaches,
known as DeepFake, have achieved significant progress and promoted a wide range of …

Mixaugment & mixup: Augmentation methods for facial expression recognition

A Psaroudakis, D Kollias - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Abstract Automatic Facial Expression Recognition (FER) has attracted increasing attention
in the last 20 years since facial expressions play a central role in human communication …