Audio deepfake detection: A survey

J Yi, C Wang, J Tao, X Zhang, CY Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Audio deepfake detection is an emerging active topic. A growing number of literatures have
aimed to study deepfake detection algorithms and achieved effective performance, the …

ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection

J Yamagishi, X Wang, M Todisco… - ASVspoof 2021 …, 2021 - inria.hal.science
ASVspoof 2021 is the forth edition in the series of biannual challenges which aim to promote
the study of spoofing and the design of countermeasures to protect automatic speaker …

Aasist: Audio anti-spoofing using integrated spectro-temporal graph attention networks

J Jung, HS Heo, H Tak, H Shim… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Artefacts that differentiate spoofed from bona-fide utterances can reside in specific temporal
or spectral intervals. Their reliable detection usually depends upon computationally …

Asvspoof 2021: Towards spoofed and deepfake speech detection in the wild

X Liu, X Wang, M Sahidullah, J Patino… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
Benchmarking initiatives support the meaningful comparison of competing solutions to
prominent problems in speech and language processing. Successive benchmarking …

A comparative study on recent neural spoofing countermeasures for synthetic speech detection

X Wang, J Yamagishi - arxiv preprint arxiv:2103.11326, 2021 - arxiv.org
A great deal of recent research effort on speech spoofing countermeasures has been
invested into back-end neural networks and training criteria. We contribute to this effort with …

End-to-end spectro-temporal graph attention networks for speaker verification anti-spoofing and speech deepfake detection

H Tak, J Jung, J Patino, M Kamble, M Todisco… - arxiv preprint arxiv …, 2021 - arxiv.org
Artefacts that serve to distinguish bona fide speech from spoofed or deepfake speech are
known to reside in specific subbands and temporal segments. Various approaches can be …

Does audio deepfake detection generalize?

NM Müller, P Czempin, F Dieckmann… - arxiv preprint arxiv …, 2022 - arxiv.org
Current text-to-speech algorithms produce realistic fakes of human voices, making deepfake
detection a much-needed area of research. While researchers have presented various …

Towards end-to-end synthetic speech detection

G Hua, ABJ Teoh, H Zhang - IEEE Signal Processing Letters, 2021 - ieeexplore.ieee.org
The constant Q transform (CQT) has been shown to be one of the most effective speech
signal pre-transforms to facilitate synthetic speech detection, followed by either hand-crafted …

Investigating self-supervised front ends for speech spoofing countermeasures

X Wang, J Yamagishi - arxiv preprint arxiv:2111.07725, 2021 - arxiv.org
Self-supervised speech model is a rapid progressing research topic, and many pre-trained
models have been released and used in various down stream tasks. For speech anti …

A survey on the detection and impacts of deepfakes in visual, audio, and textual formats

R Mubarak, T Alsboui, O Alshaikh, I Inuwa-Dutse… - Ieee …, 2023 - ieeexplore.ieee.org
In the rapidly evolving digital landscape, the generation of fake visual, audio, and textual
content poses a significant threat to the trust of society, political stability, and integrity of …