Bigvgan: A universal neural vocoder with large-scale training

S Lee, W **, B Ginsburg, B Catanzaro… - arxiv preprint arxiv …, 2022 - arxiv.org
Despite recent progress in generative adversarial network (GAN)-based vocoders, where
the model generates raw waveform conditioned on acoustic features, it is challenging to …

Real time speech enhancement in the waveform domain

A Defossez, G Synnaeve, Y Adi - arxiv preprint arxiv:2006.12847, 2020 - arxiv.org
We present a causal speech enhancement model working on the raw waveform that runs in
real-time on a laptop CPU. The proposed model is based on an encoder-decoder …

The interspeech 2020 deep noise suppression challenge: Datasets, subjective testing framework, and challenge results

CKA Reddy, V Gopal, R Cutler, E Beyrami… - arxiv preprint arxiv …, 2020 - arxiv.org
The INTERSPEECH 2020 Deep Noise Suppression (DNS) Challenge is intended to
promote collaborative research in real-time single-channel Speech Enhancement aimed to …

DNSMOS: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

CKA Reddy, V Gopal, R Cutler - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. The …

DNSMOS P. 835: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

CKA Reddy, V Gopal, R Cutler - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. We …

ICASSP 2021 deep noise suppression challenge

CKA Reddy, H Dubey, V Gopal, R Cutler… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
The Deep Noise Suppression (DNS) challenge is designed to foster innovation in the area
of noise suppression to achieve superior perceptual speech quality. We recently organized …

Interspeech 2021 deep noise suppression challenge

CKA Reddy, H Dubey, K Koishida, A Nair… - arxiv preprint arxiv …, 2021 - arxiv.org
The Deep Noise Suppression (DNS) challenge is designed to foster innovation in the area
of noise suppression to achieve superior perceptual speech quality. We recently organized …

Weighted speech distortion losses for neural-network-based real-time speech enhancement

Y **a, S Braun, CKA Reddy, H Dubey… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
This paper investigates several aspects of training a RNN (recurrent neural network) that
impact the objective and subjective quality of enhanced speech for real-time single-channel …

A survey of audio enhancement algorithms for music, speech, bioacoustics, biomedical, industrial and environmental sounds by image U-Net

S Gul, MS Khan - IEEE Access, 2023 - ieeexplore.ieee.org
The recent surge in the use of Deep Neural Networks (DNNs) has also made its mark in the
field of Audio Enhancement (AE), providing much better quality than the classical methods …

Semi-supervised spoken language understanding via self-supervised speech and language model pretraining

CI Lai, YS Chuang, HY Lee, SW Li… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Much recent work on Spoken Language Understanding (SLU) is limited in at least one of
three ways: models were trained on oracle text input and neglected ASR errors, models …