Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Speaker identification through artificial intelligence techniques: A comprehensive review and research challenges

R Jahangir, YW Teh, HF Nweke, G Mujtaba… - Expert Systems with …, 2021 - Elsevier
Speech is a powerful medium of communication that always convey rich and useful
information, such as gender, accent, and other unique characteristics of a speaker. These …

Speaker recognition from raw waveform with sincnet

M Ravanelli, Y Bengio - 2018 IEEE spoken language …, 2018 - ieeexplore.ieee.org
Deep learning is progressively gaining popularity as a viable alternative to i-vectors for
speaker recognition. Promising results have been recently obtained with Convolutional …

End-to-end spectro-temporal graph attention networks for speaker verification anti-spoofing and speech deepfake detection

H Tak, J Jung, J Patino, M Kamble, M Todisco… - arxiv preprint arxiv …, 2021 - arxiv.org
Artefacts that serve to distinguish bona fide speech from spoofed or deepfake speech are
known to reside in specific subbands and temporal segments. Various approaches can be …

Pushing the limits of raw waveform speaker recognition

J Jung, YJ Kim, HS Heo, BJ Lee, Y Kwon… - arxiv preprint arxiv …, 2022 - arxiv.org
In recent years, speaker recognition systems based on raw waveform inputs have received
increasing attention. However, the performance of such systems are typically inferior to the …

Improved rawnet with feature map scaling for text-independent speaker verification using raw waveforms

J Jung, S Kim, H Shim, J Kim, HJ Yu - arxiv preprint arxiv:2004.00526, 2020 - arxiv.org
Recent advances in deep learning have facilitated the design of speaker verification
systems that directly input raw waveforms. For example, RawNet extracts speaker …

Rawnet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification

J Jung, HS Heo, J Kim, H Shim, HJ Yu - arxiv preprint arxiv:1904.08104, 2019 - arxiv.org
Recently, direct modeling of raw waveforms using deep neural networks has been widely
studied for a number of tasks in audio domains. In speaker verification, however, utilization …

Interpretable convolutional filters with sincnet

M Ravanelli, Y Bengio - arxiv preprint arxiv:1811.09725, 2018 - arxiv.org
Deep learning is currently playing a crucial role toward higher levels of artificial intelligence.
This paradigm allows neural networks to learn complex and abstract representations, that …

Deep representation learning in speech processing: Challenges, recent advances, and future trends

S Latif, R Rana, S Khalifa, R Jurdak, J Qadir… - arxiv preprint arxiv …, 2020 - arxiv.org
Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …

A deep neural network for short-segment speaker recognition

A Hajavi, A Etemad - arxiv preprint arxiv:1907.10420, 2019 - arxiv.org
Todays interactive devices such as smart-phone assistants and smart speakers often deal
with short-duration speech segments. As a result, speaker recognition systems integrated …