[HTML][HTML] Emotion recognition and artificial intelligence: A systematic review (2014–2023) and research recommendations

SK Khare, V Blanes-Vidal, ES Nadimi, UR Acharya - Information fusion, 2024 - Elsevier
Emotion recognition is the ability to precisely infer human emotions from numerous sources
and modalities using questionnaires, physical signals, and physiological signals. Recently …

An overview of affective speech synthesis and conversion in the deep learning era

A Triantafyllopoulos, BW Schuller… - Proceedings of the …, 2023 - ieeexplore.ieee.org
Speech is the fundamental mode of human communication, and its synthesis has long been
a core priority in human–computer interaction research. In recent years, machines have …

Makelttalk: speaker-aware talking-head animation

Y Zhou, X Han, E Shechtman, J Echevarria… - ACM Transactions On …, 2020 - dl.acm.org
We present a method that generates expressive talking-head videos from a single facial
image with audio as the only input. In contrast to previous attempts to learn direct map**s …

Spoofing and countermeasures for speaker verification: A survey

Z Wu, N Evans, T Kinnunen, J Yamagishi, F Alegre… - speech …, 2015 - Elsevier
While biometric authentication has advanced significantly in recent years, evidence shows
the technology can be susceptible to malicious spoofing attacks. The research community …

An overview of voice conversion systems

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

Personal voice assistant security and privacy—a survey

P Cheng, U Roedig - Proceedings of the IEEE, 2022 - ieeexplore.ieee.org
Personal voice assistants (PVAs) are increasingly used as interfaces to digital environments.
Voice commands are used to interact with phones, smart homes, or cars. In the United …

A deep learning framework for audio deepfake detection

J Khochare, C Joshi, B Yenarkar, S Suratkar… - Arabian Journal for …, 2021 - Springer
Audio deepfakes have been increasingly emerging as a potential source of deceit, with the
development of avant-garde methods of synthetic speech generation. Hence, differentiating …

Identity and content authentication for phone calls

PG Traynor, BG Reaves, LE Blue, L Vargas… - US Patent …, 2020 - Google Patents
H04L 9/32(2006.01) H04W 12/04(2009.01) H04L 29/06(2006.01) H04W 12/06(2009.01)
H04W 12/00(2009.01) H04W 12/10(2009.01)(52) US Ci.??? H04L 65/1076 (2013.01); …

De-identification for privacy protection in multimedia content: A survey

S Ribaric, A Ariyaeeinia, N Pavesic - Signal Processing: Image …, 2016 - Elsevier
Privacy is one of the most important social and political issues in our information society,
characterized by a growing range of enabling and supporting technologies and services …

Deepfakes as a threat to a speaker and facial recognition: An overview of tools and attack vectors

A Firc, K Malinka, P Hanáček - Heliyon, 2023 - cell.com
Deepfakes present an emerging threat in cyberspace. Recent developments in machine
learning make deepfakes highly believable, and very difficult to differentiate between what is …