Scoring of large-margin embeddings for speaker verification: Cosine or PLDA?

Q Wang, KA Lee, T Liu - arxiv preprint arxiv:2204.03965, 2022 - arxiv.org
The emergence of large-margin softmax cross-entropy losses in training deep speaker
embedding neural networks has triggered a gradual shift from parametric back-ends to a …

[HTML][HTML] Enhanced Indonesian ethnic speaker recognition using data augmentation deep neural network

K Nugroho, E Noersasongko - Journal of King Saud University-Computer …, 2022 - Elsevier
Speaker Recognition is a challenging topic in Speech Processing research area. The
various models proposed have succeeded in achieving a fairly high level of accuracy in this …

Cosine Scoring with Uncertainty for Neural Speaker Embedding

Q Wang, KA Lee - IEEE Signal Processing Letters, 2024 - ieeexplore.ieee.org
Uncertainty modeling in speaker representation aims to learn the variability present in
speech utterances. While the conventional cosine-scoring is computationally efficient and …

Effects of language mismatch in automatic forensic voice comparison using deep learning embeddings

D Sztahó, A Fejes - Journal of forensic sciences, 2023 - Wiley Online Library
In forensic voice comparison, deep learning has become widely popular recently. It is mainly
used to learn speaker representations, called embeddings or embedding vectors. Speaker …

A network model of speaker identification with new feature extraction methods and asymmetric BLSTM

X Wang, F Xue, W Wang, A Liu - Neurocomputing, 2020 - Elsevier
Speaker identification has recently attracted considerable attention in speaker recognition.
Environmental noise and short utterance pose two challenges for accurate speaker …

A deep learning approach for text-independent speaker recognition with short utterances

R Chakroun, M Frikha - Multimedia Tools and Applications, 2023 - Springer
Recently, the speaker recognition techniques have been widely attractive for their extensive
use in many fields, such as speech communications, domestic services, security and access …

Incorporating uncertainty from speaker embedding estimation to speaker verification

Q Wang, KA Lee, T Liu - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
Speech utterances recorded under differing conditions exhibit varying degrees of
confidence in their embedding estimates, ie, uncertainty, even if they are extracted using the …

Data augmentation and deep neural networks for the classification of Pakistani racial speakers recognition

A Amjad, L Khan, HT Chang - PeerJ Computer Science, 2022 - peerj.com
Speech emotion recognition (SER) systems have evolved into an important method for
recognizing a person in several applications, including e-commerce, everyday interactions …

A comprehensive study on automatic speaker recognition by using deep learning techniques

VSR Gade, M Sumathi - 2021 5th International Conference on …, 2021 - ieeexplore.ieee.org
In Speaker, identifying or recognizing human voices is a challenging task. Recently, the
automatic speaker recognition technique has been developed by using deep learning …

Text-independent speaker recognition based on adaptive course learning loss and deep residual network

Q Zhong, R Dai, H Zhang, Y Zhu, G Zhou - EURASIP Journal on Advances …, 2021 - Springer
Text-independent speaker recognition is widely used in identity recognition that has a wide
spectrum of applications, such as criminal investigation, payment certification, and interest …