- Academic Search

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Tallenna Viittaa Viittausten määrä 445 Aiheeseen liittyviä artikkeleita Kaikki 9 versiota

Deep learning for biometrics: A survey

K Sundararajan, DL Woodard - ACM Computing Surveys (CSUR), 2018 - dl.acm.org

In the recent past, deep learning methods have demonstrated remarkable success for
supervised learning tasks in multiple domains including computer vision, natural language …

Tallenna Viittaa Viittausten määrä 344 Aiheeseen liittyviä artikkeleita Kaikki 2 versiota

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

Speaker recognition from raw waveform with sincnet

M Ravanelli, Y Bengio - 2018 IEEE spoken language …, 2018 - ieeexplore.ieee.org

Deep learning is progressively gaining popularity as a viable alternative to i-vectors for
speaker recognition. Promising results have been recently obtained with Convolutional …

Tallenna Viittaa Viittausten määrä 1025 Aiheeseen liittyviä artikkeleita Kaikki 10 versiota

[Free GPT-4]
[DeepSeek]

[PDF] danielpovey.com

X-vectors: Robust dnn embeddings for speaker recognition

D Snyder, D Garcia-Romero, G Sell… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

In this paper, we use data augmentation to improve performance of deep neural network
(DNN) embeddings for speaker recognition. The DNN, which is trained to discriminate …

Tallenna Viittaa Viittausten määrä 3435 Aiheeseen liittyviä artikkeleita Kaikki 10 versiota

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] Deep neural network embeddings for text-independent speaker verification.

D Snyder, D Garcia-Romero, D Povey, S Khudanpur - Interspeech, 2017 - isca-archive.org

This paper investigates replacing i-vectors for text-independent speaker verification with
embeddings extracted from a feedforward deep neural network. Long-term speaker …

Tallenna Viittaa Viittausten määrä 1121 Aiheeseen liittyviä artikkeleita Kaikki 11 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] danielpovey.com

Speaker recognition for multi-speaker conversations using x-vectors

D Snyder, D Garcia-Romero, G Sell… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

Recently, deep neural networks that map utterances to fixed-dimensional embeddings have
emerged as the state-of-the-art in speaker recognition. Our prior work introduced x-vectors …

Tallenna Viittaa Viittausten määrä 396 Aiheeseen liittyviä artikkeleita Kaikki 7 versiota

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Disentangling voice and content with self-supervision for speaker recognition

T Liu, KA Lee, Q Wang, H Li - Advances in Neural …, 2023 - proceedings.neurips.cc

For speaker recognition, it is difficult to extract an accurate speaker representation from
speech because of its mixture of speaker traits and content. This paper proposes a …

Tallenna Viittaa Viittausten määrä 30 Aiheeseen liittyviä artikkeleita Kaikki 9 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Neural voice cloning with a few samples

S Arik, J Chen, K Peng, W **… - Advances in neural …, 2018 - proceedings.neurips.cc

Voice cloning is a highly desired feature for personalized speech interfaces. We introduce a
neural voice cloning system that learns to synthesize a person's voice from only a few audio …

Tallenna Viittaa Viittausten määrä 481 Aiheeseen liittyviä artikkeleita Kaikki 12 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deep speaker: an end-to-end neural speaker embedding system

C Li, X Ma, B Jiang, X Li, X Zhang, X Liu, Y Cao… - arxiv preprint arxiv …, 2017 - arxiv.org

We present Deep Speaker, a neural speaker embedding system that maps utterances to a
hypersphere where speaker similarity is measured by cosine similarity. The embeddings …

Tallenna Viittaa Viittausten määrä 594 Aiheeseen liittyviä artikkeleita Kaikki 4 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Exploring the encoding layer and loss function in end-to-end speaker and language recognition system

W Cai, J Chen, M Li - arxiv preprint arxiv:1804.05160, 2018 - arxiv.org

In this paper, we explore the encoding/pooling layer and loss function in the end-to-end
speaker and language recognition system. First, a unified and interpretable end-to-end …

Tallenna Viittaa Viittausten määrä 417 Aiheeseen liittyviä artikkeleita Kaikki 9 versiota HTML-versio

Luo ilmoitus

Viittaa

Tarkennettu haku

Tallennettu omaan kirjastoon

Deep neural network-based speaker embeddings for end-to-end speaker verification

Speaker recognition based on deep learning: An overview

Deep learning for biometrics: A survey

Speaker recognition from raw waveform with sincnet

X-vectors: Robust dnn embeddings for speaker recognition

[PDF][PDF] Deep neural network embeddings for text-independent speaker verification.

Speaker recognition for multi-speaker conversations using x-vectors

Disentangling voice and content with self-supervision for speaker recognition

Neural voice cloning with a few samples

Deep speaker: an end-to-end neural speaker embedding system

Exploring the encoding layer and loss function in end-to-end speaker and language recognition system