Causal Speech Enhancement with Predicting Semantics based on Quantized Self-supervised Learning Features

E Tsunoo, Y Saito, W Nakata, H Saruwatari - arxiv preprint arxiv …, 2024 - arxiv.org
Real-time speech enhancement (SE) is essential to online speech communication. Causal
SE models use only the previous context while predicting future information, such as …

[PDF][PDF] MOSLight: A Lightweight Data-Efficient System for Non-Intrusive Speech Quality Assessment

Z Li, W Li - Proc. INTERSPEECH, 2023 - isca-archive.org
Automatically predicting the mean opinion score (MOS) of a synthesized speech without the
reference signal with deep learning systems has been studied extensively recently and …

Speaker adaptation using codebook integrated deep neural networks for speech enhancement

B Chidambar, D Naidu - JASA Express Letters, 2024 - pubs.aip.org
Deep neural network (DNN) based speech enhancement techniques have shown superior
performance compared to the traditional speech enhancement approaches in handling …

A DCDP-Net Speech Enhancement Model for Parallel Denoising of Amplitude and Phase Spectra

Z Feng, Z Guan, S Ou, Y Gao - 2024 5th International …, 2024 - ieeexplore.ieee.org
In this paper, we propose DCDP-Net for real-time speech enhancement in the time-
frequency domain, which denoises both the magnitude and phase spectra. DCDP-Net uses …

[PDF][PDF] Detector-driven speech background noise removal with convolutional networks

C Ayar - 2022 - dspace.yasar.edu.tr
Speech background noise is a common issue, which has become especially important with
the increasing popularity of online meetings and live internet broadcasting. Recently, Deep …

Speech Enhancement with Dual-Stream Interaction Conformer

Q Zhao, Y Gao, P Song - Authorea Preprints, 2024 - authorea.com
Since noise significantly impacts speech quality and intelligibility, it is important to eliminate
it in speech enhancement. Conformer networks have gained popularity in this area due to …