Speaker recognition based on deep learning: An overview
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …
learning has dramatically revolutionized speaker recognition. However, there is lack of …
Speaker identification through artificial intelligence techniques: A comprehensive review and research challenges
Speech is a powerful medium of communication that always convey rich and useful
information, such as gender, accent, and other unique characteristics of a speaker. These …
information, such as gender, accent, and other unique characteristics of a speaker. These …
Speaker recognition from raw waveform with sincnet
Deep learning is progressively gaining popularity as a viable alternative to i-vectors for
speaker recognition. Promising results have been recently obtained with Convolutional …
speaker recognition. Promising results have been recently obtained with Convolutional …
End-to-end spectro-temporal graph attention networks for speaker verification anti-spoofing and speech deepfake detection
Artefacts that serve to distinguish bona fide speech from spoofed or deepfake speech are
known to reside in specific subbands and temporal segments. Various approaches can be …
known to reside in specific subbands and temporal segments. Various approaches can be …
Pushing the limits of raw waveform speaker recognition
In recent years, speaker recognition systems based on raw waveform inputs have received
increasing attention. However, the performance of such systems are typically inferior to the …
increasing attention. However, the performance of such systems are typically inferior to the …
Improved rawnet with feature map scaling for text-independent speaker verification using raw waveforms
Recent advances in deep learning have facilitated the design of speaker verification
systems that directly input raw waveforms. For example, RawNet extracts speaker …
systems that directly input raw waveforms. For example, RawNet extracts speaker …
Rawnet: Advanced end-to-end deep neural network using raw waveforms for text-independent speaker verification
Recently, direct modeling of raw waveforms using deep neural networks has been widely
studied for a number of tasks in audio domains. In speaker verification, however, utilization …
studied for a number of tasks in audio domains. In speaker verification, however, utilization …
Interpretable convolutional filters with sincnet
Deep learning is currently playing a crucial role toward higher levels of artificial intelligence.
This paradigm allows neural networks to learn complex and abstract representations, that …
This paradigm allows neural networks to learn complex and abstract representations, that …
Deep representation learning in speech processing: Challenges, recent advances, and future trends
Research on speech processing has traditionally considered the task of designing hand-
engineered acoustic features (feature engineering) as a separate distinct problem from the …
engineered acoustic features (feature engineering) as a separate distinct problem from the …
A deep neural network for short-segment speaker recognition
Todays interactive devices such as smart-phone assistants and smart speakers often deal
with short-duration speech segments. As a result, speaker recognition systems integrated …
with short-duration speech segments. As a result, speaker recognition systems integrated …