HiFi-GAN: High-fidelity denoising and dereverberation based on speech deep features in adversarial networks
J Su, Z **, A Finkelstein - ar** for time domain room impulse response estimation from reverberant speech
Deep learning approaches have emerged that aim to transform an audio signal so that it
sounds as if it was recorded in the same room as a reference recording, with applications …
sounds as if it was recorded in the same room as a reference recording, with applications …
Learning audio-visual dereverberation
Reverberation not only degrades the quality of speech for human perception, but also
severely impacts the accuracy of automatic speech recognition. Prior work attempts to …
severely impacts the accuracy of automatic speech recognition. Prior work attempts to …
Polyphonic training set synthesis improves self-supervised urban sound classification
Machine listening systems for environmental acoustic monitoring face a shortage of expert
annotations to be used as training data. To circumvent this issue, the emerging paradigm of …
annotations to be used as training data. To circumvent this issue, the emerging paradigm of …
Yet another generative model for room impulse response estimation
Recent neural room impulse response (RIR) estimators typically comprise an encoder for
reference audio analysis and a generator for RIR synthesis. Especially, it is the performance …
reference audio analysis and a generator for RIR synthesis. Especially, it is the performance …
Ts-rir: Translated synthetic room impulse responses for speech augmentation
We present a method for improving the quality of synthetic room impulse responses for far-
field speech recognition. We bridge the gap between the fidelity of synthetic room impulse …
field speech recognition. We bridge the gap between the fidelity of synthetic room impulse …
Mutual learning for acoustic matching and dereverberation via visual scene-driven diffusion
Visual acoustic matching (VAM) is pivotal for enhancing the immersive experience, and the
task of dereverberation is effective in improving audio intelligibility. Existing methods treat …
task of dereverberation is effective in improving audio intelligibility. Existing methods treat …