Sixty years of frequency-domain monaural speech enhancement: From traditional to deep learning methods

C Zheng, H Zhang, W Liu, X Luo, A Li, X Li… - Trends in …, 2023 - journals.sagepub.com
Frequency-domain monaural speech enhancement has been extensively studied for over
60 years, and a great number of methods have been proposed and applied to many …

Speech enhancement and dereverberation with diffusion-based generative models

J Richter, S Welker, JM Lemercier… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
In this work, we build upon our previous publication and use diffusion-based generative
models for speech enhancement. We present a detailed overview of the diffusion process …

Recent developments in speech enhancement in the short-time Fourier transform domain

M Parchami, WP Zhu, B Champagne… - IEEE Circuits and …, 2016 - ieeexplore.ieee.org
In this paper, we present an overview on the topic of noise reduction in the short-time Fourier
transform (STFT) domain. First, we briefly review the conventional literature in the single-and …

Estimators of the magnitude-squared spectrum and methods for incorporating SNR uncertainty

Y Lu, PC Loizou - IEEE transactions on audio, speech, and …, 2010 - ieeexplore.ieee.org
Statistical estimators of the magnitude-squared spectrum are derived based on the
assumption that the magnitude-squared spectrum of the noisy speech signal can be …

A multi-frame approach to the frequency-domain single-channel noise reduction problem

YA Huang, J Benesty - IEEE Transactions on Audio, Speech …, 2011 - ieeexplore.ieee.org
This paper focuses on the class of single-channel noise reduction methods that are
performed in the frequency domain via the short-time Fourier transform (STFT). The …

Tracking of nonstationary noise based on data-driven recursive noise power estimation

JS Erkelens, R Heusdens - IEEE transactions on audio, speech …, 2008 - ieeexplore.ieee.org
This paper considers estimation of the noise spectral variance from speech signals
contaminated by highly nonstationary noise sources. The method can accurately track fast …

MMSE-optimal spectral amplitude estimation given the STFT-phase

T Gerkmann, M Krawczyk - IEEE Signal Processing Letters, 2012 - ieeexplore.ieee.org
In this letter, we derive a minimum mean squared error (MMSE) optimal estimator for clean
speech spectral amplitudes, which we apply in single channel speech enhancement. As …

Spectral masking and filtering

T Gerkmann, E Vincent - Audio source separation and speech …, 2018 - Wiley Online Library
In this chapter, we consider spectral masking filters for interference reduction in case of a
single‐channel input. The considered techniques are thus relevant when only one …

Low-distortion MMSE speech enhancement estimator based on Laplacian prior

BM Mahmmod, SH Abdulhussian… - IEEE …, 2017 - ieeexplore.ieee.org
The most well-known conventional speech enhancement algorithms introduce unwanted
artifact noise and speech distortion to the enhanced signal. Reducing the effects of such …

Analysis of the decision-directed SNR estimator for speech enhancement with respect to low-SNR and transient conditions

C Breithaupt, R Martin - IEEE transactions on audio, speech …, 2010 - ieeexplore.ieee.org
Because of their many applications and their relative ease of implementation, single-
channel speech enhancement algorithms have received much attention. As a consequence …