Rawboost: A raw data boosting and augmentation method applied to automatic speaker verification anti-spoofing

H Tak, M Kamble, J Patino, M Todisco… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
This paper introduces RawBoost, a data boosting and augmentation method for the design
of more reliable spoofing detection solutions which operate directly upon raw waveform …

[PDF][PDF] The USTC-NERCSLIP System for the Track 1.2 of Audio Deepfake Detection (ADD 2023) Challenge.

H Wu, Z Li, L Xu, Z Zhang, W Zhao, B Gu, Y Ai, Y Lu… - DADA@ IJCAI, 2023 - ceur-ws.org
This paper describes the system of USTC-NERCSLIP submitted to the track 1.2 of the
second Audio Deepfake Detection Challenge (ADD 2023). Our system consists of a …

Attention-based conditioning methods using variable frame rate for style-robust speaker verification

A Afshan, A Alwan - arxiv preprint arxiv:2206.13680, 2022 - arxiv.org
We propose an approach to extract speaker embeddings that are robust to speaking style
variations in text-independent speaker verification. Typically, speaker embedding extraction …

Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification

A Afshan, J Guo, SJ Park, V Ravi, A McCree… - arxiv preprint arxiv …, 2020 - arxiv.org
The effects of speaking-style variability on automatic speaker verification were investigated
using the UCLA Speaker Variability database which comprises multiple speaking styles per …

A Simple Unsupervised Knowledge-Free Domain Adaptation for Speaker Recognition

W Lin, L Li, D Wang - Applied Sciences, 2024 - mdpi.com
Despite the great success of speaker recognition models based on deep neural networks,
deploying a pre-trained model in real-world scenarios often leads to significant performance …

SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition

T Wang, L Li, D Wang - arxiv preprint arxiv:2406.07832, 2024 - arxiv.org
Deploying a well-optimized pre-trained speaker recognition model in a new domain often
leads to a significant decline in performance. While fine-tuning is a commonly employed …

A principle solution for enroll-test mismatch in speaker recognition

L Li, D Wang, J Kang, R Wang, J Wu… - … /ACM Transactions on …, 2022 - ieeexplore.ieee.org
Mismatch between enrollment and test conditions causes serious performance degradation
on speaker recognition systems. This paper presents a statistics decomposition (SD) …

End-to-End Modeling for Speech Spoofing and Deepfake Detection

H Tak - 2023 - theses.hal.science
Voice biometric systems are being used in various applications for secure user
authentication using automatic speaker verification technology. However, these systems are …

Squeezing value of cross-domain labels: A decoupled scoring approach for speaker verification

L Li, Y Zhang, J Kang, TF Zheng… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Domain mismatch often occurs in real applications and causes serious performance
reduction on speaker verification systems. The common wisdom is to collect cross-domain …

Source-Free Domain Adaptation for Speaker Verification in Data-Scarce Languages and Noisy Channels

SS Elia, A Malachi, V Aharonson, G Pinkas - arxiv preprint arxiv …, 2024 - arxiv.org
Domain adaptation is often hampered by exceedingly small target datasets and inaccessible
source data. These conditions are prevalent in speech verification, where privacy policies …