An overview of text-independent speaker recognition: From features to supervectors

T Kinnunen, H Li - Speech communication, 2010 - Elsevier
This paper gives an overview of automatic speaker recognition technology, with an
emphasis on text-independent recognition. Speaker recognition has been studied actively …

Speaker identification features extraction methods: A systematic review

SS Tirumala, SR Shahamiri, AS Garhwal… - Expert Systems with …, 2017 - Elsevier
Speaker Identification (SI) is the process of identifying the speaker from a given utterance by
comparing the voice biometrics of the utterance with those utterance models stored …

Including signed languages in natural language processing

K Yin, A Moryossef, J Hochgesang, Y Goldberg… - arxiv preprint arxiv …, 2021 - arxiv.org
Signed languages are the primary means of communication for many deaf and hard of
hearing individuals. Since signed languages exhibit all the fundamental linguistic properties …

Deep belief networks based voice activity detection

XL Zhang, J Wu - IEEE Transactions on Audio, Speech, and …, 2012 - ieeexplore.ieee.org
Fusing the advantages of multiple acoustic features is important for the robustness of voice
activity detection (VAD). Recently, the machine-learning-based VADs have shown a …

Real-life voice activity detection with lstm recurrent neural networks and an application to hollywood movies

F Eyben, F Weninger, S Squartini… - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
A novel, data-driven approach to voice activity detection is presented. The approach is
based on Long Short-Term Memory Recurrent Neural Networks trained on standard RASTA …

Voice activity detection. fundamentals and speech recognition system robustness

J Ramirez, JM Górriz, JC Segura - Robust speech recognition …, 2007 - books.google.com
An important drawback affecting most of the speech processing systems is the
environmental noise and its harmful effect on the system performance. Examples of such …

Unsupervised speech activity detection using voicing measures and perceptual spectral flux

SO Sadjadi, JHL Hansen - IEEE signal processing letters, 2013 - ieeexplore.ieee.org
Effective speech activity detection (SAD) is a necessary first step for robust speech
applications. In this letter, we propose a robust and unsupervised SAD solution that …

Robust voice activity detection using long-term signal variability

PK Ghosh, A Tsiartas… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
We propose a novel long-term signal variability (LTSV) measure, which describes the
degree of nonstationarity of the signal. We analyze the LTSV measure both analytically and …

The QUT-NOISE-TIMIT corpus for evaluation of voice activity detection algorithms

D Dean, S Sridharan, R Vogt… - Proceedings of the 11th …, 2010 - eprints.qut.edu.au
The QUT-NOISE-TIMIT corpus consists of 600 hours of noisy speech sequences designed to
enable a thorough evaluation of voice activity detection (VAD) algorithms across a wide …

Optimization of RNN-based speech activity detection

G Gelly, JL Gauvain - IEEE/ACM Transactions on Audio …, 2017 - ieeexplore.ieee.org
Speech activity detection (SAD) is an essential component of automatic speech recognition
systems impacting the overall system performance. This paper investigates an optimization …