Sixty years of frequency-domain monaural speech enhancement: From traditional to deep learning methods

C Zheng, H Zhang, W Liu, X Luo, A Li, X Li… - Trends in …, 2023 - journals.sagepub.com
Frequency-domain monaural speech enhancement has been extensively studied for over
60 years, and a great number of methods have been proposed and applied to many …

Vocos: Closing the gap between time-domain and fourier-based neural vocoders for high-quality audio synthesis

H Siuzdak - ar** losses for speech generation tasks
Y Ai, ZH Ling - IEEE/ACM Transactions on Audio, Speech, and …, 2024 - ieeexplore.ieee.org
This paper presents a novel neural speech phase prediction model which predicts wrapped
phase spectra directly from amplitude spectra. The proposed model is a cascade of a …