Voxsrc 2022: The fourth voxceleb speaker recognition challenge
This paper summarises the findings from the VoxCeleb Speaker Recognition Challenge
2022 (VoxSRC-22), which was held in conjunction with INTERSPEECH 2022. The goal of …
2022 (VoxSRC-22), which was held in conjunction with INTERSPEECH 2022. The goal of …
The Vox Celeb Speaker Recognition Challenge: A Retrospective
The VoxCeleb Speaker Recognition Challenges (VoxSRC) were a series of challenges and
workshops that ran annually from 2019 to 2023. The challenges primarily evaluated the …
workshops that ran annually from 2019 to 2023. The challenges primarily evaluated the …
Frame-wise and overlap-robust speaker embeddings for meeting diarization
Using a Teacher-Student training approach we developed a speaker embedding extraction
system that outputs embeddings at frame rate. Given this high temporal resolution and the …
system that outputs embeddings at frame rate. Given this high temporal resolution and the …
The dku-msxf speaker verification system for the voxceleb speaker recognition challenge 2023
This paper is the system description of the DKU-MSXF System for the track1, track2 and
track3 of the VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC-23). For Track 1, we …
track3 of the VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC-23). For Track 1, we …
Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification
It is widely acknowledged that discriminative representation for speaker verification can be
extracted from verbal speech. However, how much speaker information that non-verbal …
extracted from verbal speech. However, how much speaker information that non-verbal …
Kunqudb: An attempt for speaker verification in the chinese opera scenario
This work aims to promote Chinese opera research in both musical and speech domains,
with a primary focus on overcoming the data limitations. We introduce KunquDB …
with a primary focus on overcoming the data limitations. We introduce KunquDB …
Two-stage and self-supervised voice conversion for zero-shot dysarthric speech reconstruction
Dysarthria is a motor speech disorder commonly associated with conditions such as
cerebral palsy, Parkinson's disease, amyotrophic lateral sclerosis, and stroke. Individuals …
cerebral palsy, Parkinson's disease, amyotrophic lateral sclerosis, and stroke. Individuals …
Distance Metric-Based Open-Set Domain Adaptation for Speaker Verification
J Li, J Han, F Qian, T Zheng, Y He… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Domain shift poses a significant challenge in speaker verification, especially in open-set
scenarios where the speaker categories are disjoint between the source and target …
scenarios where the speaker categories are disjoint between the source and target …
Progressive sub-graph clustering algorithm for semi-supervised domain adaptation speaker verification
Utilizing the large-scale unlabeled data from the target domain via pseudo-label clustering
algorithms is an important approach for addressing domain adaptation problems in speaker …
algorithms is an important approach for addressing domain adaptation problems in speaker …
Multi-Objective Progressive Clustering for Semi-Supervised Domain Adaptation in Speaker Verification
Utilizing the pseudo-labeling algorithm with large-scale unlabeled data becomes crucial for
semi-supervised domain adaptation in speaker verification tasks. In this paper, we propose …
semi-supervised domain adaptation in speaker verification tasks. In this paper, we propose …