Overview of speaker modeling and its applications: From the lens of deep speaker representation learning
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …
By thoroughly and accurately modeling this information, it can be utilized in various …
Self-knowledge distillation via feature enhancement for speaker verification
As the most widely used technique, deep speaker embedding learning has become
predominant in speaker verification task recently. Very large neural networks such as …
predominant in speaker verification task recently. Very large neural networks such as …
Revisiting the statistics pooling layer in deep speaker embedding learning
The pooling function plays a vital role in the segment-level deep speaker embedding
learning framework. One common method is to calculate the statistics of the temporal …
learning framework. One common method is to calculate the statistics of the temporal …
[BOOK][B] Machine learning for speaker recognition
This book will help readers understand fundamental and advanced statistical models and
deep learning models for robust speaker recognition and domain adaptation. This useful …
deep learning models for robust speaker recognition and domain adaptation. This useful …
Depth-first neural architecture with attentive feature fusion for efficient speaker verification
Deep speaker embedding learning based on neural networks has become the predominant
approach in speaker verification (SV) currently. In prior studies, researchers have …
approach in speaker verification (SV) currently. In prior studies, researchers have …
Towards lightweight applications: Asymmetric enroll-verify structure for speaker verification
With the development of deep learning, automatic speaker verification has made
considerable progress over the past few years. However, to design a lightweight and robust …
considerable progress over the past few years. However, to design a lightweight and robust …
Binary neural network for speaker verification
[PDF][PDF] CS-CTCSCONV1D: Small footprint speaker verification with channel split time-channel-time separable 1-dimensional convolution.
We present an efficient small-footprint network for speaker verification. We start by
introducing the bottleneck to the QuartzNet model. Then we proposed a Channel Split Time …
introducing the bottleneck to the QuartzNet model. Then we proposed a Channel Split Time …
Open-set short utterance forensic speaker verification using teacher-student network with explicit inductive bias
In forensic applications, it is very common that only small naturalistic datasets consisting of
short utterances in complex or unknown acoustic environments are available. In this study …
short utterances in complex or unknown acoustic environments are available. In this study …