Build a sre challenge system: Lessons from voxsrc 2022 and cnsrc 2022

Z Chen, B Han, X **ang, H Huang, B Liu… - arxiv preprint arxiv …, 2022 - arxiv.org
Many speaker recognition challenges have been held to assess the speaker verification
system in the wild and probe the performance limit. Voxceleb Speaker Recognition …

Recursive Attentive Pooling For Extracting Speaker Embeddings From Multi-Speaker Recordings

S Horiguchi, A Ando, T Moriya… - 2024 IEEE Spoken …, 2024 - ieeexplore.ieee.org
This paper proposes a method for extracting speaker embedding for each speaker from a
variable-length recording containing multiple speakers. Speaker embeddings are crucial not …

Hybrid network with multi-level global-local statistics pooling for robust text-independent speaker recognition

WH Kang, J Alam, A Fathan - 2021 IEEE Automatic Speech …, 2021 - ieeexplore.ieee.org
In this paper, we propose a new hybrid system for extracting a speaker embedding vector.
More specifically, the proposed system employs a multi-level global-local statistics pooling …

Hybrid neural network with cross-and self-module attention pooling for text-independent speaker verification

J Alam, WH Kang, A Fathan - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Extraction of a speaker embedding vector plays an important role in deep learning-based
speaker verification. In this contribution, to extract speaker discriminant utterance level …

Synaug: Synthesis-based data augmentation for text-dependent speaker verification

C Du, B Han, S Wang, Y Qian… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Text-dependent speaker verification systems trained on large amount of labelled data
exhibit remarkable performance. However, collecting the speech from a lot of speakers with …

On the use of cross-and self-module attentive statistics pooling techniques for text-independent speaker verification

J Alam - 2023 IEEE International Joint Conference on …, 2023 - ieeexplore.ieee.org
In neural speaker verification, statistics pooling plays a key role in the learning and
extraction of a speaker embedding vector. In this contribution, we perform an investigative …

Dasa: Difficulty-aware semantic augmentation for speaker verification

Y Wang, Y Zhang, Z Wu, Z Yang, T Wei… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Data augmentation is vital to the generalization ability and robustness of deep neural
networks (DNNs) models. Existing augmentation methods for speaker verification …

Unit selection synthesis based data augmentation for fixed phrase speaker verification

H Huang, X **ang, F Zhao, S Wang… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Data augmentation is commonly used to help build a robust speaker verification system,
especially in limited-resource case. However, conventional data augmentation methods …

[PDF][PDF] Hybrid neural network-based deep embedding extractors for text-independent speaker verification

J Alam, WH Kang, A Fathan - extraction, 2022 - isca-archive.org
In this contribution, we propose a multi-stream hybrid neural network for extracting speaker
discriminant utterance-level embedding vectors. In this approach, an input acoustic feature …

[PDF][PDF] Investigation on Deep Speaker Embedding Extraction Methods for Multi-Genre Speaker Verification.

WH Kang, J Alam - Odyssey, 2022 - isca-archive.org
In this paper, we provide description of our experimented systems on the CNCeleb dataset.
The CNCeleb dataset provides a difficult set of trial that were collected from multiple genres …