Aasist: Audio anti-spoofing using integrated spectro-temporal graph attention networks

J Jung, HS Heo, H Tak, H Shim… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Artefacts that differentiate spoofed from bona-fide utterances can reside in specific temporal
or spectral intervals. Their reliable detection usually depends upon computationally …

End-to-end spectro-temporal graph attention networks for speaker verification anti-spoofing and speech deepfake detection

H Tak, J Jung, J Patino, M Kamble, M Todisco… - arxiv preprint arxiv …, 2021 - arxiv.org
Artefacts that serve to distinguish bona fide speech from spoofed or deepfake speech are
known to reside in specific subbands and temporal segments. Various approaches can be …

[PDF][PDF] The effect of silence and dual-band fusion in anti-spoofing system

Y Zhang12, W Wang12, P Zhang12 - Proc. Interspeech, 2021 - isca-archive.org
The current neural network based anti-spoofing systems have poor robustness. Their
performance degrades further after voice activity detection (VAD) performed, making it …

[PDF][PDF] ResNet and Model Fusion for Automatic Spoofing Detection.

Z Chen, Z **e, W Zhang, X Xu - Interspeech, 2017 - isca-archive.org
Speaker verification systems have achieved great progress in recent years. Unfortunately,
they are still highly prone to different kinds of spoofing attacks such as speech synthesis …

Graph attention networks for anti-spoofing

H Tak, J Jung, J Patino, M Todisco, N Evans - arxiv preprint arxiv …, 2021 - arxiv.org
The cues needed to detect spoofing attacks against automatic speaker verification are often
located in specific spectral sub-bands or temporal segments. Previous works show the …

Spoofing attack detection using the non-linear fusion of sub-band classifiers

H Tak, J Patino, A Nautsch, N Evans… - arxiv preprint arxiv …, 2020 - arxiv.org
The threat of spoofing can pose a risk to the reliability of automatic speaker verification.
Results from the bi-annual ASVspoof evaluations show that effective countermeasures …

A study on data augmentation in voice anti-spoofing

A Cohen, I Rimon, E Aflalo, HH Permuter - Speech Communication, 2022 - Elsevier
In this paper we perform an in depth study of how data augmentation techniques improve
synthetic or spoofed audio detection. Specifically, we propose methods to deal with channel …

Significance of subband features for synthetic speech detection

J Yang, RK Das, H Li - IEEE Transactions on Information …, 2019 - ieeexplore.ieee.org
In text-to-speech or voice conversion based synthetic speech detection, it is a common
practice that spectral information over the entire frequency band is used for feature …

A Survey on Speech Deepfake Detection

M Li, Y Ahmadiadli, XP Zhang - ACM Computing Surveys, 2025 - dl.acm.org
The availability of smart devices leads to an exponential increase in multimedia content.
However, advancements in deep learning have also enabled the creation of highly …

Spectral features for synthetic speech detection

D Paul, M Pal, G Saha - IEEE journal of selected topics in signal …, 2017 - ieeexplore.ieee.org
Recent advancements in voice conversion (VC) and speech synthesis research make
speech-based biometric systems highly prone to spoofing attacks. This can provoke an …