HiFi-GAN: High-fidelity denoising and dereverberation based on speech deep features in adversarial networks

J Su, Z **, A Finkelstein - ar** for time domain room impulse response estimation from reverberant speech
CJ Steinmetz, VK Ithapu… - 2021 IEEE Workshop on …, 2021‏ - ieeexplore.ieee.org
Deep learning approaches have emerged that aim to transform an audio signal so that it
sounds as if it was recorded in the same room as a reference recording, with applications …

Learning audio-visual dereverberation

C Chen, W Sun, D Harwath… - ICASSP 2023-2023 IEEE …, 2023‏ - ieeexplore.ieee.org
Reverberation not only degrades the quality of speech for human perception, but also
severely impacts the accuracy of automatic speech recognition. Prior work attempts to …

Polyphonic training set synthesis improves self-supervised urban sound classification

F Gontier, V Lostanlen, M Lagrange, N Fortin… - The Journal of the …, 2021‏ - pubs.aip.org
Machine listening systems for environmental acoustic monitoring face a shortage of expert
annotations to be used as training data. To circumvent this issue, the emerging paradigm of …

Yet another generative model for room impulse response estimation

S Lee, HS Choi, K Lee - … of Signal Processing to Audio and …, 2023‏ - ieeexplore.ieee.org
Recent neural room impulse response (RIR) estimators typically comprise an encoder for
reference audio analysis and a generator for RIR synthesis. Especially, it is the performance …

Ts-rir: Translated synthetic room impulse responses for speech augmentation

A Ratnarajah, Z Tang… - 2021 IEEE automatic …, 2021‏ - ieeexplore.ieee.org
We present a method for improving the quality of synthetic room impulse responses for far-
field speech recognition. We bridge the gap between the fidelity of synthetic room impulse …

Mutual learning for acoustic matching and dereverberation via visual scene-driven diffusion

J Ma, W Wang, Y Yang, F Zheng - European Conference on Computer …, 2024‏ - Springer
Visual acoustic matching (VAM) is pivotal for enhancing the immersive experience, and the
task of dereverberation is effective in improving audio intelligibility. Existing methods treat …