Speaker recognition based on deep learning: An overview
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …
learning has dramatically revolutionized speaker recognition. However, there is lack of …
Deep speaker embeddings for Speaker Verification: Review and experimental comparison
The construction of speaker-specific acoustic models for automatic speaker recognition is
almost exclusively based on deep neural network-based speaker embeddings. This work …
almost exclusively based on deep neural network-based speaker embeddings. This work …
Multi-view self-attention based transformer for speaker recognition
Initially developed for natural language processing (NLP), Transformer model is now widely
used for speech processing tasks such as speaker recognition, due to its powerful sequence …
used for speech processing tasks such as speaker recognition, due to its powerful sequence …
MEConformer: Highly representative embedding extractor for speaker verification via incorporating selective convolution into deep speaker encoder
Transformer models have demonstrated superior performance across various domains,
including computer vision, natural language processing, and speech recognition. The …
including computer vision, natural language processing, and speech recognition. The …
Audio deepfake detection system with neural stitching for add 2022
This paper describes our best system and methodology for ADD 2022: The First Audio Deep
Synthesis Detection Challenge [1]. The very same system was used for both two rounds of …
Synthesis Detection Challenge [1]. The very same system was used for both two rounds of …
[PDF][PDF] Branch-ECAPA-TDNN: A parallel branch architecture to capture local and global features for speaker verification
Currently, ECAPA-TDNN is one of the state-of-the-art deep models for automatic speaker
verification (ASV). However, it focuses too much on local feature extraction with fixed local …
verification (ASV). However, it focuses too much on local feature extraction with fixed local …
[HTML][HTML] Causal reasoning for algorithmic fairness in voice controlled cyber-physical systems
Automated speaker recognition is enabling personalized interactions with the voice-based
interfaces and assistants part of the modern cyber-physical-social systems. Prior studies …
interfaces and assistants part of the modern cyber-physical-social systems. Prior studies …
Time-domain speaker verification using temporal convolutional networks
Recently, speaker verification systems using deep neural networks have been widely
studied. Many of them utilize hand-crafted features such as mel-filterbank energies, mel …
studied. Many of them utilize hand-crafted features such as mel-filterbank energies, mel …
Global–local self-attention based transformer for speaker verification
F **e, D Zhang, C Liu - Applied Sciences, 2022 - mdpi.com
Transformer models are now widely used for speech processing tasks due to their powerful
sequence modeling capabilities. Previous work determined an efficient way to model …
sequence modeling capabilities. Previous work determined an efficient way to model …
Dictionary attacks on speaker verification
In this paper, we propose dictionary attacks against speaker verification-a novel attack
vector that aims to match a large fraction of speaker population by chance. We introduce a …
vector that aims to match a large fraction of speaker population by chance. We introduce a …