An overview of text-independent speaker recognition: From features to supervectors
This paper gives an overview of automatic speaker recognition technology, with an
emphasis on text-independent recognition. Speaker recognition has been studied actively …
emphasis on text-independent recognition. Speaker recognition has been studied actively …
Speaker identification features extraction methods: A systematic review
Speaker Identification (SI) is the process of identifying the speaker from a given utterance by
comparing the voice biometrics of the utterance with those utterance models stored …
comparing the voice biometrics of the utterance with those utterance models stored …
Including signed languages in natural language processing
Signed languages are the primary means of communication for many deaf and hard of
hearing individuals. Since signed languages exhibit all the fundamental linguistic properties …
hearing individuals. Since signed languages exhibit all the fundamental linguistic properties …
Deep belief networks based voice activity detection
Fusing the advantages of multiple acoustic features is important for the robustness of voice
activity detection (VAD). Recently, the machine-learning-based VADs have shown a …
activity detection (VAD). Recently, the machine-learning-based VADs have shown a …
Real-life voice activity detection with lstm recurrent neural networks and an application to hollywood movies
A novel, data-driven approach to voice activity detection is presented. The approach is
based on Long Short-Term Memory Recurrent Neural Networks trained on standard RASTA …
based on Long Short-Term Memory Recurrent Neural Networks trained on standard RASTA …
Voice activity detection. fundamentals and speech recognition system robustness
An important drawback affecting most of the speech processing systems is the
environmental noise and its harmful effect on the system performance. Examples of such …
environmental noise and its harmful effect on the system performance. Examples of such …
Unsupervised speech activity detection using voicing measures and perceptual spectral flux
Effective speech activity detection (SAD) is a necessary first step for robust speech
applications. In this letter, we propose a robust and unsupervised SAD solution that …
applications. In this letter, we propose a robust and unsupervised SAD solution that …
Robust voice activity detection using long-term signal variability
We propose a novel long-term signal variability (LTSV) measure, which describes the
degree of nonstationarity of the signal. We analyze the LTSV measure both analytically and …
degree of nonstationarity of the signal. We analyze the LTSV measure both analytically and …
The QUT-NOISE-TIMIT corpus for evaluation of voice activity detection algorithms
The QUT-NOISE-TIMIT corpus consists of 600 hours of noisy speech sequences designed to
enable a thorough evaluation of voice activity detection (VAD) algorithms across a wide …
enable a thorough evaluation of voice activity detection (VAD) algorithms across a wide …
Optimization of RNN-based speech activity detection
Speech activity detection (SAD) is an essential component of automatic speech recognition
systems impacting the overall system performance. This paper investigates an optimization …
systems impacting the overall system performance. This paper investigates an optimization …