Deep learning for environmentally robust speech recognition: An overview of recent developments

Z Zhang, J Geiger, J Pohjalainen, AED Mousa… - ACM Transactions on …, 2018 - dl.acm.org
Eliminating the negative effect of non-stationary environmental noise is a long-standing
research topic for automatic speech recognition but still remains an important challenge …

Acoustic vector sensor: reviews and future perspectives

J Cao, J Liu, J Wang, X Lai - IET Signal Processing, 2017 - Wiley Online Library
Acoustic vector sensor (AVS) has been recently researched and developed for acoustic
wave capturing and signal processing. Conventional array generally employs spatially …

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org
Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

[BOG][B] Fundamentals of signal enhancement and array signal processing

J Benesty, I Cohen, J Chen - 2017 - books.google.com
A comprehensive guide to the theory and practice of signal enhancement and array signal
processing, including matlab codes, exercises and instructor and solution manuals …

[BOG][B] Theory and applications of spherical microphone array processing

DP Jarrett, EAP Habets, PA Naylor - 2017 - Springer
The topic of spherical microphone array signal processing has been gaining importance
since the publications of Meyer and Elko around 2002, and fuelled by many others since …

Coherent-to-diffuse power ratio estimation for dereverberation

A Schwarz, W Kellermann - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
The estimation of the time-and frequency-dependent coherent-to-diffuse power ratio (CDR)
from the measured spatial coherence between two omnidirectional microphones is …

Microphone array processing for distant speech recognition: From close-talking microphones to far-field sensors

K Kumatani, J McDonough, B Raj - IEEE Signal Processing …, 2012 - ieeexplore.ieee.org
Distant speech recognition (DSR) holds the promise of the most natural human computer
interface because it enables man-machine interactions through speech, without the …

Recent developments in speech enhancement in the short-time Fourier transform domain

M Parchami, WP Zhu, B Champagne… - IEEE Circuits and …, 2016 - ieeexplore.ieee.org
In this paper, we present an overview on the topic of noise reduction in the short-time Fourier
transform (STFT) domain. First, we briefly review the conventional literature in the single-and …

Evaluation and comparison of late reverberation power spectral density estimators

S Braun, A Kuklasiński, O Schwartz… - … on Audio, Speech …, 2018 - ieeexplore.ieee.org
Reduction of late reverberation can be achieved using spatio-spectral filters, such as the
multichannel Wiener filter. To compute this filter, an estimate of the late reverberation power …

A dual-microphone speech enhancement algorithm based on the coherence function

N Yousefian, PC Loizou - IEEE Transactions on Audio, Speech …, 2011 - ieeexplore.ieee.org
A novel dual-microphone speech enhancement technique is proposed in the present paper.
The technique utilizes the coherence between the target and noise signals as a criterion for …