Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

A survey on text-dependent and text-independent speaker verification

Y Tu, W Lin, MW Mak - IEEE Access, 2022 - ieeexplore.ieee.org
Speaker verification (SV) aims to detect an individual's identity from his/her voice. SV has
been successfully applied in various areas such as access control, remote service …

Disentangling voice and content with self-supervision for speaker recognition

T Liu, KA Lee, Q Wang, H Li - Advances in Neural …, 2023 - proceedings.neurips.cc
For speaker recognition, it is difficult to extract an accurate speaker representation from
speech because of its mixture of speaker traits and content. This paper proposes a …

The INTERSPEECH 2020 far-field speaker verification challenge

X Qin, M Li, H Bu, W Rao, RK Das… - arxiv preprint arxiv …, 2020 - arxiv.org
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge (FFSVC 2020)
addresses three different research problems under well-defined conditions: far-field text …

Multi-resolution multi-head attention in deep speaker embedding

Z Wang, K Yao, X Li, S Fang - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
Pooling is an essential component to capture long-term speaker characteristics for speaker
recognition. This paper proposes simple but effective pooling methods to compute attentive …

Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

Within-sample variability-invariant loss for robust speaker recognition under noisy environments

D Cai, W Cai, M Li - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
Despite the significant improvements in speaker recognition enabled by deep neural
networks, unsatisfactory performance persists under noisy environments. In this paper, we …

An empirical study on channel effects for synthetic voice spoofing countermeasure systems

Y Zhang, G Zhu, F Jiang, Z Duan - arxiv preprint arxiv:2104.01320, 2021 - arxiv.org
Spoofing countermeasure (CM) systems are critical in speaker verification; they aim to
discern spoofing attacks from bona fide speech trials. In practice, however, acoustic …

Adapting deep learning models to new acoustic environments-a case study on the north atlantic right whale upcall

B Padovese, OS Kirsebom, F Frazao, CHM Evers… - Ecological …, 2023 - Elsevier
Passive acoustic monitoring is increasingly being used for studying marine mammals,
leading to the accumulation of large acoustic datasets. Analyzing these datasets becomes …

Speaker adaptation for attention-based end-to-end speech recognition

Z Meng, Y Gaur, J Li, Y Gong - arxiv preprint arxiv:1911.03762, 2019 - arxiv.org
We propose three regularization-based speaker adaptation approaches to adapt the
attention-based encoder-decoder (AED) model with very limited adaptation data from target …