Sixty years of frequency-domain monaural speech enhancement: From traditional to deep learning methods

C Zheng, H Zhang, W Liu, X Luo, A Li, X Li… - Trends in …, 2023 - journals.sagepub.com
Frequency-domain monaural speech enhancement has been extensively studied for over
60 years, and a great number of methods have been proposed and applied to many …

Tea-pse 2.0: Sub-band network for real-time personalized speech enhancement

Y Ju, S Zhang, W Rao, Y Wang, T Yu… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org
Personalized speech enhancement (PSE) utilizes additional cues like speaker embeddings
to remove background noise and interfering speech and extract the speech from target …

[PDF][PDF] Spectro-Temporal SubNet for Real-Time Monaural Speech Denoising and Dereverberation.

F **ong, W Chen, P Wang, X Li, J Feng - Interspeech, 2022 - researchgate.net
This paper presents an improved subband neural network applied to joint speech denoising
and dereverberation for online single-channel scenarios. Preserving the advantages of …

Deep subband network for joint suppression of echo, noise and reverberation in real-time fullband speech communication

F **ong, M Dong, K Zhou, H Zhu… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
This paper presents a deep and lightweight subband neural network which jointly
suppresses the common interference in real-time fullband speech communication: echo …

Estimation of ideal binary mask for audio-visual monaural speech enhancement

S Balasubramanian, R Rajavel, A Kar - Circuits, Systems, and Signal …, 2023 - Springer
The estimation of the Ideal Binary Mask (IBM) based on speech cochleagram and visual
cues were carried out in this paper to improve the speech intelligibility and quality using an …

Low-complexity filter for software-defined radio by modulated interpolated coefficient decimated filter in a hybrid Farrow

TO Otunniyi, HC Myburgh - Sensors, 2022 - mdpi.com
Realising a low-complexity Farrow channelisation algorithm for multi-standard receivers in
software-defined radio is a challenging task. A Farrow filter operates best at low frequencies …

Interference-Controlled Maximum Noise Reduction Beamformer Based on Deep-Learned Interference Manifold

Y Yang, N Pan, W Zhang, C Pan… - … /ACM Transactions on …, 2024 - ieeexplore.ieee.org
Beamforming has been used in a wide range of applications to extract the signal of interest
from microphone array observations, which consist of not only the signal of interest, but also …

[PDF][PDF] Investigation on the Band Importance of Phase-aware Speech Enhancement.

Z Zhang, DS Williamson, Y Shen - INTERSPEECH, 2022 - researchgate.net
Many existing phase-aware speech enhancement algorithms consider the phase at all
spectral frequencies to be equally important to perceptual quality and intelligibility. Although …

TS-CGANet: A Two-Stage Complex and Real Dual-Path Sub-Band Fusion Network for Full-Band Speech Enhancement

H Chen, X Zhang - Applied Sciences, 2023 - mdpi.com
Speech enhancement based on deep neural networks faces difficulties, as modeling more
frequency bands can lead to a decrease in the resolution of low-frequency bands and …

Band-Split Inter-SubNet: Band-Split with Subband Interaction for Monaural Speech Enhancement

YC Pan, YL Shen, YF Liao… - 2024 Asia Pacific Signal …, 2024 - ieeexplore.ieee.org
Speech enhancement models are developed to improve quality and intelligibility of speech
for numerous daily applications. With the rapid development of technology, the neural …