Google Наука

D Michelsanti, ZH Tan, SX Zhang, Y Xu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org

Speech enhancement and speech separation are two related tasks, whose purpose is to
extract either one or more target speech signals, respectively, from a mixture of sounds …

Запазване Позоваване С позовавания в 306 Сродни статии Всички 6 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

ADL-MVDR: All deep learning MVDR beamformer for target speech separation

Z Zhang, Y Xu, M Yu, SX Zhang… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

Speech separation algorithms are often used to separate the target speech from other
interfering sources. However, purely neural network based speech separation systems often …

Запазване Позоваване С позовавания в 150 Сродни статии Всички 7 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards unified all-neural beamforming for time and frequency domain speech separation

R Gu, SX Zhang, Y Zou, D Yu - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org

Recently, frequency domain all-neural beamforming methods have achieved remarkable
progress for multichannel speech separation. In parallel, the integration of time domain …

Запазване Позоваване С позовавания в 29 Сродни статии Всички 5 версии

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

UNSSOR: Unsupervised neural speech separation by leveraging over-determined training mixtures

ZQ Wang, S Watanabe - Advances in Neural Information …, 2023 - proceedings.neurips.cc

In reverberant conditions with multiple concurrent speakers, each microphone acquires a
mixture signal of multiple speakers at a different location. In over-determined conditions …

Запазване Позоваване С позовавания в 11 Сродни статии Всички 10 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Audio-visual end-to-end multi-channel speech separation, dereverberation and recognition

G Li, J Deng, M Geng, Z **, T Wang… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

Accurate recognition of cocktail party speech containing overlap** speakers, noise and
reverberation remains a highly challenging task to date. Motivated by the invariance of …

Запазване Позоваване С позовавания в 15 Сродни статии Всички 7 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Complex neural spatial filter: Enhancing multi-channel target speech separation in complex domain

R Gu, SX Zhang, Y Zou, D Yu - IEEE Signal Processing Letters, 2021 - ieeexplore.ieee.org

To date, mainstream target speech separation (TSS) approaches are formulated to estimate
the complex ratio mask (cRM) of target speech in time-frequency domain under supervised …

Запазване Позоваване С позовавания в 42 Сродни статии Всички 5 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Generalized spatio-temporal RNN beamformer for target speech separation

Y Xu, Z Zhang, M Yu, SX Zhang, D Yu - arxiv preprint arxiv:2101.01280, 2021 - arxiv.org

Although the conventional mask-based minimum variance distortionless response (MVDR)
could reduce the non-linear distortion, the residual noise level of the MVDR separated …

Запазване Позоваване С позовавания в 49 Сродни статии Всички 6 версии Във вид на HTML

X-tf-gridnet: A time–frequency domain target speaker extraction network with adaptive speaker embedding fusion

F Hao, X Li, C Zheng - Information Fusion, 2024 - Elsevier

Target speaker extraction (TSE) which has the capability to directly extract desired speech
given enrollment utterances of the target speaker has attracted more and more attention for …

Запазване Позоваване С позовавания в 8 Сродни статии Всички 3 версии

[Free GPT-4]
[DeepSeek]

[PDF] uni-paderborn.de

End-to-end dereverberation, beamforming, and speech recognition in a cocktail party

W Zhang, X Chang, C Boeddeker… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org

Far-field multi-speaker automatic speech recognition (ASR) has drawn increasing attention
in recent years. Most existing methods feature a signal processing frontend and an ASR …

Запазване Позоваване С позовавания в 21 Сродни статии Всички 6 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-channel multi-frame ADL-MVDR for target speech separation

Z Zhang, Y Xu, M Yu, SX Zhang, L Chen… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org

Many purely neural network based speech separation approaches have been proposed to
improve objective assessment scores, but they often introduce nonlinear distortions that are …

Запазване Позоваване С позовавания в 33 Сродни статии Всички 6 версии

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Neural spatio-temporal beamformer for target speech separation

An overview of deep-learning-based audio-visual speech enhancement and separation

ADL-MVDR: All deep learning MVDR beamformer for target speech separation

Towards unified all-neural beamforming for time and frequency domain speech separation

UNSSOR: Unsupervised neural speech separation by leveraging over-determined training mixtures

Audio-visual end-to-end multi-channel speech separation, dereverberation and recognition

Complex neural spatial filter: Enhancing multi-channel target speech separation in complex domain

Generalized spatio-temporal RNN beamformer for target speech separation

X-tf-gridnet: A time–frequency domain target speaker extraction network with adaptive speaker embedding fusion

End-to-end dereverberation, beamforming, and speech recognition in a cocktail party

Multi-channel multi-frame ADL-MVDR for target speech separation