[HTML][HTML] Voxceleb: Large-scale speaker verification in the wild
The objective of this work is speaker recognition under noisy and unconstrained conditions.
We make two key contributions. First, we introduce a very large-scale audio-visual dataset …
We make two key contributions. First, we introduce a very large-scale audio-visual dataset …
Optimization of data-driven filterbank for automatic speaker verification
Most of the speech processing applications use triangular filters spaced in mel-scale for
feature extraction. In this paper, we propose a new data-driven filter design method which …
feature extraction. In this paper, we propose a new data-driven filter design method which …
**-vector embedding for speaker recognition
We present a Bayesian formulation for deep speaker embedding, wherein the xi-vector is
the Bayesian counterpart of the x-vector, taking into account the uncertainty estimate. On the …
the Bayesian counterpart of the x-vector, taking into account the uncertainty estimate. On the …
Audio-visual speaker recognition with a cross-modal discriminative network
Audio-visual speaker recognition is one of the tasks in the recent 2019 NIST speaker
recognition evaluation (SRE). Studies in neuroscience and computer science all point to the …
recognition evaluation (SRE). Studies in neuroscience and computer science all point to the …
Voxceleb enrichment for age and gender recognition
VoxCeleb datasets are widely used in speaker recognition studies. Our work serves two
purposes. First, we provide speaker age labels and (an alternative) annotation of speaker …
purposes. First, we provide speaker age labels and (an alternative) annotation of speaker …
A study of bias mitigation strategies for speaker recognition
Speaker recognition is increasingly used in several everyday applications including smart
speakers, customer care centers and other speech-driven analytics. It is crucial to accurately …
speakers, customer care centers and other speech-driven analytics. It is crucial to accurately …
Towards robust speaker verification with target speaker enhancement
This paper proposes the target speaker enhancement based speaker verification network
(TASE-SVNet), an all neural model that couples target speaker enhancement and speaker …
(TASE-SVNet), an all neural model that couples target speaker enhancement and speaker …
An investigation of domain adaptation in speaker embedding space for speaker recognition
Speaker recognition continues to grow as a research challenge in the field with expanded
application in commercial, forensic, educational and general speech technology interfaces …
application in commercial, forensic, educational and general speech technology interfaces …
Incorporating uncertainty from speaker embedding estimation to speaker verification
Speech utterances recorded under differing conditions exhibit varying degrees of
confidence in their embedding estimates, ie, uncertainty, even if they are extracted using the …
confidence in their embedding estimates, ie, uncertainty, even if they are extracted using the …
NEC-TT system for mixed-bandwidth and multi-domain speaker recognition
This paper describes the NEC-TT speaker recognition system designed for the 2018
Speaker Recognition Evaluation (SRE'18) benchmarking. The NEC-TT submission was …
Speaker Recognition Evaluation (SRE'18) benchmarking. The NEC-TT submission was …