Google Наука

ZQ Wang, P Wang, DL Wang - IEEE/ACM transactions on …, 2021 - ieeexplore.ieee.org

We propose multi-microphone complex spectral map**, a simple way of applying deep
learning for time-varying non-linear beamforming, for speaker separation in reverberant …

Запазване Позоваване С позовавания в 92 Сродни статии Всички 9 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards unified all-neural beamforming for time and frequency domain speech separation

R Gu, SX Zhang, Y Zou, D Yu - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org

Recently, frequency domain all-neural beamforming methods have achieved remarkable
progress for multichannel speech separation. In parallel, the integration of time domain …

Запазване Позоваване С позовавания в 29 Сродни статии Всички 5 версии

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Multi-channel talker-independent speaker separation through location-based training

H Taherian, K Tan, DL Wang - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org

Permutation ambiguity is a crucial issue for deep learning based talker-independent
speaker separation. Deep clustering and permutation invariant training (PIT) have been …

Запазване Позоваване С позовавания в 31 Сродни статии Всички 5 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Audio-visual end-to-end multi-channel speech separation, dereverberation and recognition

G Li, J Deng, M Geng, Z **, T Wang… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

Accurate recognition of cocktail party speech containing overlap** speakers, noise and
reverberation remains a highly challenging task to date. Motivated by the invariance of …

Запазване Позоваване С позовавания в 15 Сродни статии Всички 7 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-channel speech separation using spatially selective deep non-linear filters

K Tesch, T Gerkmann - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org

In a multi-channel separation task with multiple speakers, we aim to recover all individual
speech signals from the mixture. In contrast to single-channel approaches, which rely on the …

Запазване Позоваване С позовавания в 18 Сродни статии Всички 4 версии

[Free GPT-4]
[DeepSeek]

[PDF] uni-paderborn.de

End-to-end dereverberation, beamforming, and speech recognition in a cocktail party

W Zhang, X Chang, C Boeddeker… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org

Far-field multi-speaker automatic speech recognition (ASR) has drawn increasing attention
in recent years. Most existing methods feature a signal processing frontend and an ASR …

Запазване Позоваване С позовавания в 21 Сродни статии Всички 6 версии

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

A novel approach to multi-channel speech enhancement based on graph neural networks

HN Chau, TD Bui, HB Nguyen… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org

Multi-channel speech enhancement aims at utilizing spatial relationships between signals
captured from a microphone array along with temporal-spectral information efficiently to …

Запазване Позоваване С позовавания в 6 Сродни статии Всички 3 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Closing the gap between time-domain multi-channel speech enhancement on real and simulation conditions

W Zhang, J Shi, C Li, S Watanabe… - 2021 IEEE Workshop on …, 2021 - ieeexplore.ieee.org

The deep learning based time-domain models, eg Conv-TasNet, have shown great potential
in both single-channel and multi-channel speech enhancement. However, many …

Запазване Позоваване С позовавания в 25 Сродни статии Всички 7 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Implicit neural spatial filtering for multichannel source separation in the waveform domain

D Markovic, A Defossez, A Richard - arxiv preprint arxiv:2206.15423, 2022 - arxiv.org

We present a single-stage casual waveform-to-waveform multichannel model that can
separate moving sound sources based on their broad spatial locations in a dynamic …

Запазване Позоваване С позовавания в 17 Сродни статии Всички 4 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A time-domain real-valued generalized wiener filter for multi-channel neural separation systems

Y Luo - IEEE/ACM Transactions on Audio, Speech, and …, 2022 - ieeexplore.ieee.org

Frequency-domain beamformers have been successful in a wide range of multi-channel
neural separation systems in the past years. However, the operations in conventional …

Запазване Позоваване С позовавания в 20 Сродни статии Всички 4 версии

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Enhancing end-to-end multi-channel speech separation via spatial feature learning

Multi-microphone complex spectral map** for utterance-wise and continuous speech separation

Towards unified all-neural beamforming for time and frequency domain speech separation

Multi-channel talker-independent speaker separation through location-based training

Audio-visual end-to-end multi-channel speech separation, dereverberation and recognition

Multi-channel speech separation using spatially selective deep non-linear filters

End-to-end dereverberation, beamforming, and speech recognition in a cocktail party

A novel approach to multi-channel speech enhancement based on graph neural networks

Closing the gap between time-domain multi-channel speech enhancement on real and simulation conditions

Implicit neural spatial filtering for multichannel source separation in the waveform domain

A time-domain real-valued generalized wiener filter for multi-channel neural separation systems