Rawboost: A raw data boosting and augmentation method applied to automatic speaker verification anti-spoofing
This paper introduces RawBoost, a data boosting and augmentation method for the design
of more reliable spoofing detection solutions which operate directly upon raw waveform …
of more reliable spoofing detection solutions which operate directly upon raw waveform …
[PDF][PDF] The USTC-NERCSLIP System for the Track 1.2 of Audio Deepfake Detection (ADD 2023) Challenge.
This paper describes the system of USTC-NERCSLIP submitted to the track 1.2 of the
second Audio Deepfake Detection Challenge (ADD 2023). Our system consists of a …
second Audio Deepfake Detection Challenge (ADD 2023). Our system consists of a …
Attention-based conditioning methods using variable frame rate for style-robust speaker verification
We propose an approach to extract speaker embeddings that are robust to speaking style
variations in text-independent speaker verification. Typically, speaker embedding extraction …
variations in text-independent speaker verification. Typically, speaker embedding extraction …
Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification
The effects of speaking-style variability on automatic speaker verification were investigated
using the UCLA Speaker Variability database which comprises multiple speaking styles per …
using the UCLA Speaker Variability database which comprises multiple speaking styles per …
A Simple Unsupervised Knowledge-Free Domain Adaptation for Speaker Recognition
Despite the great success of speaker recognition models based on deep neural networks,
deploying a pre-trained model in real-world scenarios often leads to significant performance …
deploying a pre-trained model in real-world scenarios often leads to significant performance …
SE/BN Adapter: Parametric Efficient Domain Adaptation for Speaker Recognition
Deploying a well-optimized pre-trained speaker recognition model in a new domain often
leads to a significant decline in performance. While fine-tuning is a commonly employed …
leads to a significant decline in performance. While fine-tuning is a commonly employed …
A principle solution for enroll-test mismatch in speaker recognition
Mismatch between enrollment and test conditions causes serious performance degradation
on speaker recognition systems. This paper presents a statistics decomposition (SD) …
on speaker recognition systems. This paper presents a statistics decomposition (SD) …
End-to-End Modeling for Speech Spoofing and Deepfake Detection
H Tak - 2023 - theses.hal.science
Voice biometric systems are being used in various applications for secure user
authentication using automatic speaker verification technology. However, these systems are …
authentication using automatic speaker verification technology. However, these systems are …
Squeezing value of cross-domain labels: A decoupled scoring approach for speaker verification
Domain mismatch often occurs in real applications and causes serious performance
reduction on speaker verification systems. The common wisdom is to collect cross-domain …
reduction on speaker verification systems. The common wisdom is to collect cross-domain …
Source-Free Domain Adaptation for Speaker Verification in Data-Scarce Languages and Noisy Channels
Domain adaptation is often hampered by exceedingly small target datasets and inaccessible
source data. These conditions are prevalent in speech verification, where privacy policies …
source data. These conditions are prevalent in speech verification, where privacy policies …