Audio deepfake detection: A survey
Audio deepfake detection is an emerging active topic. A growing number of literatures have
aimed to study deepfake detection algorithms and achieved effective performance, the …
aimed to study deepfake detection algorithms and achieved effective performance, the …
ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
ASVspoof 2021 is the forth edition in the series of biannual challenges which aim to promote
the study of spoofing and the design of countermeasures to protect automatic speaker …
the study of spoofing and the design of countermeasures to protect automatic speaker …
Aasist: Audio anti-spoofing using integrated spectro-temporal graph attention networks
Artefacts that differentiate spoofed from bona-fide utterances can reside in specific temporal
or spectral intervals. Their reliable detection usually depends upon computationally …
or spectral intervals. Their reliable detection usually depends upon computationally …
Asvspoof 2021: Towards spoofed and deepfake speech detection in the wild
Benchmarking initiatives support the meaningful comparison of competing solutions to
prominent problems in speech and language processing. Successive benchmarking …
prominent problems in speech and language processing. Successive benchmarking …
A comparative study on recent neural spoofing countermeasures for synthetic speech detection
A great deal of recent research effort on speech spoofing countermeasures has been
invested into back-end neural networks and training criteria. We contribute to this effort with …
invested into back-end neural networks and training criteria. We contribute to this effort with …
End-to-end spectro-temporal graph attention networks for speaker verification anti-spoofing and speech deepfake detection
Artefacts that serve to distinguish bona fide speech from spoofed or deepfake speech are
known to reside in specific subbands and temporal segments. Various approaches can be …
known to reside in specific subbands and temporal segments. Various approaches can be …
Does audio deepfake detection generalize?
Current text-to-speech algorithms produce realistic fakes of human voices, making deepfake
detection a much-needed area of research. While researchers have presented various …
detection a much-needed area of research. While researchers have presented various …
Towards end-to-end synthetic speech detection
The constant Q transform (CQT) has been shown to be one of the most effective speech
signal pre-transforms to facilitate synthetic speech detection, followed by either hand-crafted …
signal pre-transforms to facilitate synthetic speech detection, followed by either hand-crafted …
Investigating self-supervised front ends for speech spoofing countermeasures
Self-supervised speech model is a rapid progressing research topic, and many pre-trained
models have been released and used in various down stream tasks. For speech anti …
models have been released and used in various down stream tasks. For speech anti …
A survey on the detection and impacts of deepfakes in visual, audio, and textual formats
In the rapidly evolving digital landscape, the generation of fake visual, audio, and textual
content poses a significant threat to the trust of society, political stability, and integrity of …
content poses a significant threat to the trust of society, political stability, and integrity of …