- Academic Search

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Enregistrer Citer Cité 441 fois Autres articles Les 9 versions Free GPT-4

Speaker identification through artificial intelligence techniques: A comprehensive review and research challenges

R Jahangir, YW Teh, HF Nweke, G Mujtaba… - Expert Systems with …, 2021 - Elsevier

Speech is a powerful medium of communication that always convey rich and useful
information, such as gender, accent, and other unique characteristics of a speaker. These …

Enregistrer Citer Cité 124 fois Autres articles Les 4 versions Free GPT-4

[Free GPT-4]

[PDF] researchgate.net

Speaker recognition from raw waveform with sincnet

M Ravanelli, Y Bengio - 2018 IEEE spoken language …, 2018 - ieeexplore.ieee.org

Deep learning is progressively gaining popularity as a viable alternative to i-vectors for
speaker recognition. Promising results have been recently obtained with Convolutional …

Enregistrer Citer Cité 1018 fois Autres articles Les 10 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

End-to-end spectro-temporal graph attention networks for speaker verification anti-spoofing and speech deepfake detection

H Tak, J Jung, J Patino, M Kamble, M Todisco… - arxiv preprint arxiv …, 2021 - arxiv.org

Artefacts that serve to distinguish bona fide speech from spoofed or deepfake speech are
known to reside in specific subbands and temporal segments. Various approaches can be …

Enregistrer Citer Cité 188 fois Autres articles Les 10 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Pushing the limits of raw waveform speaker recognition

J Jung, YJ Kim, HS Heo, BJ Lee, Y Kwon… - arxiv preprint arxiv …, 2022 - arxiv.org

In recent years, speaker recognition systems based on raw waveform inputs have received
increasing attention. However, the performance of such systems are typically inferior to the …

Enregistrer Citer Cité 105 fois Autres articles Les 11 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Improved rawnet with feature map scaling for text-independent speaker verification using raw waveforms

J Jung, S Kim, H Shim, J Kim, HJ Yu - arxiv preprint arxiv:2004.00526, 2020 - arxiv.org

Recent advances in deep learning have facilitated the design of speaker verification
systems that directly input raw waveforms. For example, RawNet extracts speaker …

Enregistrer Citer Cité 145 fois Autres articles Les 9 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Rawnet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification

J Jung, HS Heo, J Kim, H Shim, HJ Yu - arxiv preprint arxiv:1904.08104, 2019 - arxiv.org

Recently, direct modeling of raw waveforms using deep neural networks has been widely
studied for a number of tasks in audio domains. In speaker verification, however, utilization …

Enregistrer Citer Cité 175 fois Autres articles Les 8 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] researchgate.net

Interpretable convolutional filters with sincnet

M Ravanelli, Y Bengio - arxiv preprint arxiv:1811.09725, 2018 - arxiv.org

Deep learning is currently playing a crucial role toward higher levels of artificial intelligence.
This paradigm allows neural networks to learn complex and abstract representations, that …

Enregistrer Citer Cité 152 fois Autres articles Les 5 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, R Rana, S Khalifa, R Jurdak, J Qadir… - arxiv preprint arxiv …, 2020 - arxiv.org

Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …

Enregistrer Citer Cité 116 fois Autres articles Les 3 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

A deep neural network for short-segment speaker recognition

A Hajavi, A Etemad - arxiv preprint arxiv:1907.10420, 2019 - arxiv.org

Todays interactive devices such as smart-phone assistants and smart speakers often deal
with short-duration speech segments. As a result, speaker recognition systems integrated …

Enregistrer Citer Cité 94 fois Autres articles Les 5 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Avoiding speaker overfitting in end-to-end dnns using raw waveform for text-independent speaker...

Speaker recognition based on deep learning: An overview

Speaker identification through artificial intelligence techniques: A comprehensive review and research challenges

Speaker recognition from raw waveform with sincnet

End-to-end spectro-temporal graph attention networks for speaker verification anti-spoofing and speech deepfake detection

Pushing the limits of raw waveform speaker recognition

Improved rawnet with feature map scaling for text-independent speaker verification using raw waveforms

Rawnet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification

Interpretable convolutional filters with sincnet

Deep representation learning in speech processing: Challenges, recent advances, and future trends

A deep neural network for short-segment speaker recognition