Google Académico

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

P Ochieng - Artificial Intelligence Review, 2023 - Springer

Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …

Guardar Citar Citado por 27 Artículos relacionados Las 8 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Wham!: Extending speech separation to noisy environments

G Wichern, J Antognini, M Flynn, LR Zhu… - ar** speakers using
a single audio channel has brought us closer to solving the cocktail party problem. However …

Guardar Citar Citado por 396 Artículos relacionados Las 11 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Poconet: Better speech enhancement with frequency-positional embeddings, semi-supervised conversational data, and biased loss

U Isik, R Giri, N Phansalkar, JM Valin… - arxiv preprint arxiv …, 2020 - arxiv.org

Neural network applications generally benefit from larger-sized models, but for current
speech enhancement models, larger scale networks often suffer from decreased robustness …

Guardar Citar Citado por 115 Artículos relacionados Las 12 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

DeepFilterNet: A low complexity speech enhancement framework for full-band audio based on deep filtering

H Schroter, AN Escalante-B… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Complex-valued processing has brought deep learning-based speech enhancement and
signal extraction to a new level. Typically, the process is based on a time-frequency (TF) …

Guardar Citar Citado por 83 Artículos relacionados Las 3 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Time domain audio visual speech separation

J Wu, Y Xu, SX Zhang, LW Chen, M Yu… - 2019 IEEE automatic …, 2019 - ieeexplore.ieee.org

Audio-visual multi-modal modeling has been demonstrated to be effective in many speech
related tasks, such as speech recognition and speech enhancement. This paper introduces …

Guardar Citar Citado por 143 Artículos relacionados Las 5 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Differentiable consistency constraints for improved deep speech enhancement

S Wisdom, JR Hershey, K Wilson… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

In recent years, deep networks have led to dramatic improvements in speech enhancement
by framing it as a data-driven pattern recognition problem. In many modern enhancement …

Guardar Citar Citado por 125 Artículos relacionados Las 7 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deep learning based phase reconstruction for speaker separation: A trigonometric perspective

ZQ Wang, K Tan, DL Wang - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org

This study investigates phase reconstruction for deep learning based monaural talker-
independent speaker separation in the short-time Fourier transform (STFT) domain. The key …

Guardar Citar Citado por 108 Artículos relacionados Las 10 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Two-step sound source separation: Training on learned latent targets

E Tzinis, S Venkataramani, Z Wang… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

In this paper, we propose a two-step training procedure for source separation via a deep
neural network. In the first step we learn a transform (and it's inverse) to a latent space where …

Guardar Citar Citado por 90 Artículos relacionados Las 4 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

End-to-end music source separation: Is it possible in the waveform domain?

F Lluís, J Pons, X Serra - arxiv preprint arxiv:1810.12187, 2018 - arxiv.org

Most of the currently successful source separation techniques use the magnitude
spectrogram as input, and are therefore by default omitting part of the signal: the phase. To …

Guardar Citar Citado por 97 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

End-to-end multi-channel speech separation

R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu… - arxiv preprint arxiv …, 2019 - arxiv.org

The end-to-end approach for single-channel speech separation has been studied recently
and shown promising results. This paper extended the previous approach and proposed a …

Guardar Citar Citado por 88 Artículos relacionados Las 3 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Phasebook and friends: Leveraging discrete representations for source separation

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

Wham!: Extending speech separation to noisy environments

Poconet: Better speech enhancement with frequency-positional embeddings, semi-supervised conversational data, and biased loss

DeepFilterNet: A low complexity speech enhancement framework for full-band audio based on deep filtering

Time domain audio visual speech separation

Differentiable consistency constraints for improved deep speech enhancement

Deep learning based phase reconstruction for speaker separation: A trigonometric perspective

Two-step sound source separation: Training on learned latent targets

End-to-end music source separation: Is it possible in the waveform domain?

End-to-end multi-channel speech separation