Audio deepfake detection: A survey

J Yi, C Wang, J Tao, X Zhang, CY Zhang… - arxiv preprint arxiv …, 2023‏ - arxiv.org
Audio deepfake detection is an emerging active topic. A growing number of literatures have
aimed to study deepfake detection algorithms and achieved effective performance, the …

Battling voice spoofing: a review, comparative analysis, and generalizability evaluation of state-of-the-art voice spoofing counter measures

A Khan, KM Malik, J Ryan, M Saravanan - Artificial Intelligence Review, 2023‏ - Springer
With the advent of automated speaker verification (ASV) systems comes an equal and
opposite development: malicious actors may seek to use voice spoofing attacks to fool those …

Asvspoof 2021: Towards spoofed and deepfake speech detection in the wild

X Liu, X Wang, M Sahidullah, J Patino… - … on Audio, Speech …, 2023‏ - ieeexplore.ieee.org
Benchmarking initiatives support the meaningful comparison of competing solutions to
prominent problems in speech and language processing. Successive benchmarking …

Avoid-df: Audio-visual joint learning for detecting deepfake

W Yang, X Zhou, Z Chen, B Guo, Z Ba… - IEEE Transactions …, 2023‏ - ieeexplore.ieee.org
Recently, deepfakes have raised severe concerns about the authenticity of online media.
Prior works for deepfake detection have made many efforts to capture the intra-modal …

Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation

H Tak, M Todisco, X Wang, J Jung, J Yamagishi… - arxiv preprint arxiv …, 2022‏ - arxiv.org
The performance of spoofing countermeasure systems depends fundamentally upon the use
of sufficiently representative training data. With this usually being limited, current solutions …

Uniaudio: Towards universal audio generation with large language models

D Yang, J Tian, X Tan, R Huang, S Liu… - … on Machine Learning, 2024‏ - openreview.net
Audio generation is a major branch of generative AI research. Compared with prior works in
this area that are commonly task-specific with heavy domain knowledge, this paper …

ASVspoof 5: Crowdsourced speech data, deepfakes, and adversarial attacks at scale

X Wang, H Delgado, H Tak, J Jung, H Shim… - arxiv preprint arxiv …, 2024‏ - arxiv.org
ASVspoof 5 is the fifth edition in a series of challenges that promote the study of speech
spoofing and deepfake attacks, and the design of detection solutions. Compared to previous …

Mlaad: The multi-language audio anti-spoofing dataset

NM Müller, P Kawa, WH Choong… - … Joint Conference on …, 2024‏ - ieeexplore.ieee.org
Text-to-Speech (TTS) technology brings significant advantages, such as giving a voice to
those with speech impairments, but also enables audio deepfakes and spoofs. The former …

Generation and detection of manipulated multimodal audiovisual content: Advances, trends and open challenges

H Liz-Lopez, M Keita, A Taleb-Ahmed, A Hadid… - Information …, 2024‏ - Elsevier
Generative deep learning techniques have invaded the public discourse recently. Despite
the advantages, the applications to disinformation are concerning as the counter-measures …

SASV 2022: The first spoofing-aware speaker verification challenge

J Jung, H Tak, H Shim, HS Heo, BJ Lee… - arxiv preprint arxiv …, 2022‏ - arxiv.org
The first spoofing-aware speaker verification (SASV) challenge aims to integrate research
efforts in speaker verification and anti-spoofing. We extend the speaker verification scenario …