Bigvgan: A universal neural vocoder with large-scale training
Despite recent progress in generative adversarial network (GAN)-based vocoders, where
the model generates raw waveform conditioned on acoustic features, it is challenging to …
the model generates raw waveform conditioned on acoustic features, it is challenging to …
Real time speech enhancement in the waveform domain
We present a causal speech enhancement model working on the raw waveform that runs in
real-time on a laptop CPU. The proposed model is based on an encoder-decoder …
real-time on a laptop CPU. The proposed model is based on an encoder-decoder …
The interspeech 2020 deep noise suppression challenge: Datasets, subjective testing framework, and challenge results
The INTERSPEECH 2020 Deep Noise Suppression (DNS) Challenge is intended to
promote collaborative research in real-time single-channel Speech Enhancement aimed to …
promote collaborative research in real-time single-channel Speech Enhancement aimed to …
DNSMOS: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors
Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. The …
human perception. Perceptual objective metrics serve as a proxy for subjective scores. The …
DNSMOS P. 835: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors
Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. We …
human perception. Perceptual objective metrics serve as a proxy for subjective scores. We …
ICASSP 2021 deep noise suppression challenge
The Deep Noise Suppression (DNS) challenge is designed to foster innovation in the area
of noise suppression to achieve superior perceptual speech quality. We recently organized …
of noise suppression to achieve superior perceptual speech quality. We recently organized …
Interspeech 2021 deep noise suppression challenge
The Deep Noise Suppression (DNS) challenge is designed to foster innovation in the area
of noise suppression to achieve superior perceptual speech quality. We recently organized …
of noise suppression to achieve superior perceptual speech quality. We recently organized …
Weighted speech distortion losses for neural-network-based real-time speech enhancement
This paper investigates several aspects of training a RNN (recurrent neural network) that
impact the objective and subjective quality of enhanced speech for real-time single-channel …
impact the objective and subjective quality of enhanced speech for real-time single-channel …
A survey of audio enhancement algorithms for music, speech, bioacoustics, biomedical, industrial and environmental sounds by image U-Net
S Gul, MS Khan - IEEE Access, 2023 - ieeexplore.ieee.org
The recent surge in the use of Deep Neural Networks (DNNs) has also made its mark in the
field of Audio Enhancement (AE), providing much better quality than the classical methods …
field of Audio Enhancement (AE), providing much better quality than the classical methods …
Semi-supervised spoken language understanding via self-supervised speech and language model pretraining
Much recent work on Spoken Language Understanding (SLU) is limited in at least one of
three ways: models were trained on oracle text input and neglected ASR errors, models …
three ways: models were trained on oracle text input and neglected ASR errors, models …