Recent developments in speech enhancement in the short-time Fourier transform domain

M Parchami, WP Zhu, B Champagne… - IEEE Circuits and …, 2016 - ieeexplore.ieee.org
In this paper, we present an overview on the topic of noise reduction in the short-time Fourier
transform (STFT) domain. First, we briefly review the conventional literature in the single-and …

A comprehensive empirical review of modern voice activity detection approaches for movies and TV shows

M Sharma, S Joshi, T Chatterjee, R Hamid - Neurocomputing, 2022 - Elsevier
A robust and language agnostic Voice Activity Detection (VAD) is crucial for Digital
Entertainment Content (DEC). Primary examples of DEC include movies and TV series …

Boosting contextual information for deep neural network based voice activity detection

XL Zhang, DL Wang - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
Voice activity detection (VAD) is an important topic in audio signal processing. Contextual
information is important for improving the performance of VAD at low signal-to-noise ratios …

CNN-based speech segments endpoints detection framework using short-time signal energy features

G Ahmed, AA Lawaye - International Journal of Information Technology, 2023 - Springer
Abstract The quality of Speech Recognition systems has improved, with a shift focus from
short utterance scenarios like Voice Assistants and Voice Search to extended utterance …

[PDF][PDF] rtCaptcha: A Real-Time CAPTCHA Based Liveness Detection System.

E Uzun, SPH Chung, I Essa, W Lee - NDSS, 2018 - ndss-symposium.org
Facial/voice-based authentication is becoming increasingly popular (eg, already adopted by
MasterCard and AliPay), because it is easy to use. In particular, users can now authenticate …

Acoustic feature based unsupervised approach of heart sound event detection

S Das, S Pal, M Mitra - Computers in biology and medicine, 2020 - Elsevier
This paper represents an unsupervised approach to detect the positions of S1, S2 heart
sound events in a Phonocardiogram (PCG) recording. Insufficiency of correctly annotated …

[PDF][PDF] Boosted deep neural networks and multi-resolution cochleagram features for voice activity detection.

XL Zhang, DL Wang - INTERSPEECH, 2014 - xiaolei-zhang.net
Voice activity detection (VAD) is an important frontend of many speech processing systems.
In this paper, we describe a new VAD algorithm based on boosted deep neural networks …

Formant-based robust voice activity detection

IC Yoo, H Lim, D Yook - IEEE/ACM Transactions on audio …, 2015 - ieeexplore.ieee.org
Voice activity detection (VAD) can be used to distinguish human speech from other sounds,
and various applications can benefit from VAD-including speech coding and speech …

Audio-visual voice activity detection using diffusion maps

D Dov, R Talmon, I Cohen - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
The performance of traditional voice activity detectors significantly deteriorates in the
presence of highly nonstationary noise and transient interferences. One solution is to …

Voice activity detection for transient noisy environment based on diffusion nets

A Ivry, B Berdugo, I Cohen - IEEE Journal of Selected Topics in …, 2019 - ieeexplore.ieee.org
We address voice activity detection in acoustic environments of transients and stationary
noises, which often occur in real-life scenarios. We exploit unique spatial patterns of speech …