Sixty years of frequency-domain monaural speech enhancement: From traditional to deep learning methods

C Zheng, H Zhang, W Liu, X Luo, A Li, X Li… - Trends in …, 2023 - journals.sagepub.com
Frequency-domain monaural speech enhancement has been extensively studied for over
60 years, and a great number of methods have been proposed and applied to many …

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org
Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

Two heads are better than one: A two-stage complex spectral map** approach for monaural speech enhancement

A Li, W Liu, C Zheng, C Fan, X Li - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
For challenging acoustic scenarios as low signal-to-noise ratios, current speech
enhancement systems usually suffer from performance bottleneck in extracting the target …

Glance and gaze: A collaborative learning framework for single-channel speech enhancement

A Li, C Zheng, L Zhang, X Li - Applied Acoustics, 2022 - Elsevier
The capability of the human to pay attention to both coarse and fine-grained regions has
been applied to computer vision tasks. Motivated by that, we propose a collaborative …

Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation

A Ozerov, C Févotte - IEEE transactions on audio, speech, and …, 2009 - ieeexplore.ieee.org
We consider inference in a general data-driven object-based model of multichannel audio
data, assumed generated as a possibly underdetermined convolutive mixture of source …

The 2018 signal separation evaluation campaign

FR Stöter, A Liutkus, N Ito - … Variable Analysis and Signal Separation: 14th …, 2018 - Springer
This paper reports the organization and results for the 2018 community-based Signal
Separation Evaluation Campaign (SiSEC 2018). This year's edition was focused on audio …

Under-determined reverberant audio source separation using a full-rank spatial covariance model

NQK Duong, E Vincent… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
This paper addresses the modeling of reverberant recording environments in the context of
under-determined convolutive blind source separation. We model the contribution of each …

Music demixing challenge 2021

Y Mitsufuji, G Fabbro, S Uhlich, FR Stöter… - Frontiers in Signal …, 2022 - frontiersin.org
Music source separation has been intensively studied in the last decade and tremendous
progress with the advent of deep learning could be observed. Evaluation campaigns such …

Multichannel extensions of non-negative matrix factorization with complex-valued data

H Sawada, H Kameoka, S Araki… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
This paper presents new formulations and algorithms for multichannel extensions of non-
negative matrix factorization (NMF). The formulations employ Hermitian positive semidefinite …

Subjective and objective quality assessment of audio source separation

V Emiya, E Vincent, N Harlander… - IEEE Transactions on …, 2011 - ieeexplore.ieee.org
We aim to assess the perceived quality of estimated source signals in the context of audio
source separation. These signals may involve one or more kinds of distortions, including …