Voxsrc 2022: The fourth voxceleb speaker recognition challenge

J Huh, A Brown, J Jung, JS Chung, A Nagrani… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper summarises the findings from the VoxCeleb Speaker Recognition Challenge
2022 (VoxSRC-22), which was held in conjunction with INTERSPEECH 2022. The goal of …

The Vox Celeb Speaker Recognition Challenge: A Retrospective

J Huh, JS Chung, A Nagrani, A Brown… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
The VoxCeleb Speaker Recognition Challenges (VoxSRC) were a series of challenges and
workshops that ran annually from 2019 to 2023. The challenges primarily evaluated the …

Frame-wise and overlap-robust speaker embeddings for meeting diarization

T Cord-Landwehr, C Boeddeker… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Using a Teacher-Student training approach we developed a speaker embedding extraction
system that outputs embeddings at frame rate. Given this high temporal resolution and the …

The dku-msxf speaker verification system for the voxceleb speaker recognition challenge 2023

Z Li, Y Lin, X Qin, N Jiang, G Zhao, M Li - arxiv preprint arxiv:2308.08766, 2023 - arxiv.org
This paper is the system description of the DKU-MSXF System for the track1, track2 and
track3 of the VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC-23). For Track 1, we …

Haha-POD: An Attempt for Laughter-Based Non-Verbal Speaker Verification

Y Lin, X Qin, N Jiang, G Zhao… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
It is widely acknowledged that discriminative representation for speaker verification can be
extracted from verbal speech. However, how much speaker information that non-verbal …

Kunqudb: An attempt for speaker verification in the chinese opera scenario

H Zhou, Y Lin, D Liu, M Li - International Conference on Pattern …, 2024 - Springer
This work aims to promote Chinese opera research in both musical and speech domains,
with a primary focus on overcoming the data limitations. We introduce KunquDB …

Two-stage and self-supervised voice conversion for zero-shot dysarthric speech reconstruction

D Liu, Y Lin, H Bu, M Li - 2024 International Conference on …, 2024 - ieeexplore.ieee.org
Dysarthria is a motor speech disorder commonly associated with conditions such as
cerebral palsy, Parkinson's disease, amyotrophic lateral sclerosis, and stroke. Individuals …

Distance Metric-Based Open-Set Domain Adaptation for Speaker Verification

J Li, J Han, F Qian, T Zheng, Y He… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Domain shift poses a significant challenge in speaker verification, especially in open-set
scenarios where the speaker categories are disjoint between the source and target …

Progressive sub-graph clustering algorithm for semi-supervised domain adaptation speaker verification

Z Li, J Lu, Z Zhao, W Wang, M Wang… - … Conference on Signal …, 2024 - ieeexplore.ieee.org
Utilizing the large-scale unlabeled data from the target domain via pseudo-label clustering
algorithms is an important approach for addressing domain adaptation problems in speaker …

Multi-Objective Progressive Clustering for Semi-Supervised Domain Adaptation in Speaker Verification

Z Li, Y Lin, N Jiang, X Qin, G Zhao… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Utilizing the pseudo-labeling algorithm with large-scale unlabeled data becomes crucial for
semi-supervised domain adaptation in speaker verification tasks. In this paper, we propose …