Deep belief networks based voice activity detection
Fusing the advantages of multiple acoustic features is important for the robustness of voice
activity detection (VAD). Recently, the machine-learning-based VADs have shown a …
activity detection (VAD). Recently, the machine-learning-based VADs have shown a …
Hotword detection on multiple devices
Methods, systems, and apparatus, including computer programs encoded on a computer
storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a …
storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a …
Speaker verification using co-location information
Methods, systems, and apparatus, including computer pro grams encoded on computer
storage media, for identifying a user in a multi-user environment. One of the methods …
storage media, for identifying a user in a multi-user environment. One of the methods …
Real-life voice activity detection with lstm recurrent neural networks and an application to hollywood movies
A novel, data-driven approach to voice activity detection is presented. The approach is
based on Long Short-Term Memory Recurrent Neural Networks trained on standard RASTA …
based on Long Short-Term Memory Recurrent Neural Networks trained on standard RASTA …
Voice activity detection. fundamentals and speech recognition system robustness
An important drawback affecting most of the speech processing systems is the
environmental noise and its harmful effect on the system performance. Examples of such …
environmental noise and its harmful effect on the system performance. Examples of such …
A convolutional neural network smartphone app for real-time voice activity detection
This paper presents a smartphone app that performs real-time voice activity detection based
on convolutional neural network. Real-time implementation issues are discussed showing …
on convolutional neural network. Real-time implementation issues are discussed showing …
Unsupervised speech activity detection using voicing measures and perceptual spectral flux
Effective speech activity detection (SAD) is a necessary first step for robust speech
applications. In this letter, we propose a robust and unsupervised SAD solution that …
applications. In this letter, we propose a robust and unsupervised SAD solution that …
Boosting contextual information for deep neural network based voice activity detection
Voice activity detection (VAD) is an important topic in audio signal processing. Contextual
information is important for improving the performance of VAD at low signal-to-noise ratios …
information is important for improving the performance of VAD at low signal-to-noise ratios …
Collaborative voice controlled devices
Methods, systems, and apparatus, including computer pro grams encoded on a computer
storage medium, for collabo ration between multiple voice controlled devices are dis closed …
storage medium, for collabo ration between multiple voice controlled devices are dis closed …
Features for voice activity detection: a comparative analysis
In many speech signal processing applications, voice activity detection (VAD) plays an
essential role for separating an audio stream into time intervals that contain speech activity …
essential role for separating an audio stream into time intervals that contain speech activity …