Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

Self-knowledge distillation via feature enhancement for speaker verification

B Liu, H Wang, Z Chen, S Wang… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
As the most widely used technique, deep speaker embedding learning has become
predominant in speaker verification task recently. Very large neural networks such as …

Revisiting the statistics pooling layer in deep speaker embedding learning

S Wang, Y Yang, Y Qian, K Yu - 2021 12th International …, 2021 - ieeexplore.ieee.org
The pooling function plays a vital role in the segment-level deep speaker embedding
learning framework. One common method is to calculate the statistics of the temporal …

[BOOK][B] Machine learning for speaker recognition

MW Mak, JT Chien - 2020 - books.google.com
This book will help readers understand fundamental and advanced statistical models and
deep learning models for robust speaker recognition and domain adaptation. This useful …

Depth-first neural architecture with attentive feature fusion for efficient speaker verification

B Liu, Z Chen, Y Qian - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
Deep speaker embedding learning based on neural networks has become the predominant
approach in speaker verification (SV) currently. In prior studies, researchers have …

Towards lightweight applications: Asymmetric enroll-verify structure for speaker verification

Q Li, L Yang, X Wang, X Qin, J Wang… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
With the development of deep learning, automatic speaker verification has made
considerable progress over the past few years. However, to design a lightweight and robust …

Binary neural network for speaker verification

T Zhu, X Qin, M Li - ar** a lightweight speaker embedding extractor (SEE) is crucial for the practical
implementation of automatic speaker verification (ASV) systems. To this end, we recently …

[PDF][PDF] CS-CTCSCONV1D: Small footprint speaker verification with channel split time-channel-time separable 1-dimensional convolution.

L Cai, Y Yang, X Chen, W Tu, H Chen - INTERSPEECH, 2022 - isca-archive.org
We present an efficient small-footprint network for speaker verification. We start by
introducing the bottleneck to the QuartzNet model. Then we proposed a Channel Split Time …

Open-set short utterance forensic speaker verification using teacher-student network with explicit inductive bias

M Sang, W **a, JHL Hansen - arxiv preprint arxiv:2009.09556, 2020 - arxiv.org
In forensic applications, it is very common that only small naturalistic datasets consisting of
short utterances in complex or unknown acoustic environments are available. In this study …