DeepFake on face and expression swap: A review

S Waseem, SARSA Bakar, BA Ahmed, Z Omar… - IEEE …, 2023 - ieeexplore.ieee.org
Remarkable advances have been made in deep learning, leading to the emergence of
highly realistic AI-generated videos known as deepfakes. Deepfakes use generative models …

[HTML][HTML] Video and audio deepfake datasets and open issues in deepfake technology: being ahead of the curve

Z Akhtar, TL Pendyala, VS Athmakuri - Forensic Sciences, 2024 - mdpi.com
The revolutionary breakthroughs in Machine Learning (ML) and Artificial Intelligence (AI) are
extensively being harnessed across a diverse range of domains, eg, forensic science …

Safeear: Content privacy-preserving audio deepfake detection

X Li, K Li, Y Zheng, C Yan, X Ji, W Xu - Proceedings of the 2024 on ACM …, 2024 - dl.acm.org
Text-to-Speech (TTS) and Voice Conversion (VC) models have exhibited remarkable
performance in generating realistic and natural audio. However, their dark side, audio …

Fake artificial intelligence generated contents (FAIGC): a survey of theories, detection methods, and opportunities

X Yu, Y Wang, Y Chen, Z Tao, D **, S Song… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, generative artificial intelligence models, represented by Large Language
Models (LLMs) and Diffusion Models (DMs), have revolutionized content production …

Audio deepfake detection with self-supervised xls-r and sls classifier

Q Zhang, S Wen, T Hu - Proceedings of the 32nd ACM International …, 2024 - dl.acm.org
Generative AI technologies, including text-to-speech (TTS) and voice conversion (VC),
frequently become indistinguishable from genuine samples, posing challenges for …

Audio–visual deepfake detection using articulatory representation learning

Y Wang, H Huang - Computer Vision and Image Understanding, 2024 - Elsevier
Advancements in generative artificial intelligence have made it easier to manipulate auditory
and visual elements, highlighting the critical need for robust audio–visual deepfake …

Slim: Style-linguistics mismatch model for generalized audio deepfake detection

Y Zhu, S Koppisetti, T Tran, G Bharaj - arxiv preprint arxiv:2407.18517, 2024 - arxiv.org
Audio deepfake detection (ADD) is crucial to combat the misuse of speech synthesized from
generative AI models. Existing ADD models suffer from generalization issues, with a large …

Spmis: An investigation of synthetic spoken misinformation detection

P Liu, L Wang, R He, H He, L Wang… - 2024 IEEE Spoken …, 2024 - ieeexplore.ieee.org
In recent years, speech generation technology has advanced rapidly, fueled by generative
models and large-scale training techniques. While these developments have enabled the …

From Audio Deepfake Detection to AI-Generated Music Detection--A Pathway and Overview

Y Li, M Milling, L Specia, BW Schuller - arxiv preprint arxiv:2412.00571, 2024 - arxiv.org
As Artificial Intelligence (AI) technologies continue to evolve, their use in generating realistic,
contextually appropriate content has expanded into various domains. Music, an art form and …

[PDF][PDF] Towards generalisable and calibrated audio deepfake detection with self-supervised representations

O Pascu, A Stan, D Oneata, E Oneata, H Cucu - Interspeech, 2024 - isca-archive.org
Generalisation—the ability of a model to perform well on unseen data—is crucial for building
reliable deepfake detectors. However, recent studies have shown that the current audio …