- Academic Search

Y Qian, C Weng, X Chang, S Wang, D Yu - Frontiers of Information …, 2018 - Springer

The cocktail party problem, ie, tracing and recognizing the speech of a specific speaker
when multiple speakers talk simultaneously, is one of the critical problems yet to be solved …

保存引用被引用数: 105 関連記事全 6 バージョン

Recent developments in speech enhancement in the short-time Fourier transform domain

M Parchami, WP Zhu, B Champagne… - IEEE Circuits and …, 2016 - ieeexplore.ieee.org

In this paper, we present an overview on the topic of noise reduction in the short-time Fourier
transform (STFT) domain. First, we briefly review the conventional literature in the single-and …

保存引用被引用数: 99 関連記事

[Free GPT-4]

[PDF] hal.science

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org

Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

保存引用被引用数: 646 関連記事全 12 バージョン

[Free GPT-4]

[PDF] isca-archive.org

[PDF][PDF] Improved MVDR beamforming using single-channel mask prediction networks.

H Erdogan, JR Hershey, S Watanabe, MI Mandel… - Interspeech, 2016 - isca-archive.org

Recent studies on multi-microphone speech databases indicate that it is beneficial to
perform beamforming to improve speech recognition accuracies, especially when there is a …

保存引用被引用数: 384 関連記事全 14 バージョン HTMLバージョン

[Free GPT-4]

[PDF] merl.com

Multi-channel deep clustering: Discriminative spectral and spatial embeddings for speaker-independent speech separation

ZQ Wang, J Le Roux, JR Hershey - 2018 IEEE International …, 2018 - ieeexplore.ieee.org

The recently-proposed deep clustering algorithm represents a fundamental advance
towards solving the cocktail party problem in the single-channel case. When multiple …

保存引用被引用数: 281 関連記事全 7 バージョン

[Free GPT-4]

[PDF] arxiv.org

Far-field automatic speech recognition

R Haeb-Umbach, J Heymann, L Drude… - Proceedings of the …, 2020 - ieeexplore.ieee.org

The machine recognition of speech spoken at a distance from the microphones, known as
far-field automatic speech recognition (ASR), has received a significant increase in attention …

保存引用被引用数: 121 関連記事全 8 バージョン

Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise

T Higuchi, N Ito, T Yoshioka… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org

This paper considers acoustic beamforming for noise robust automatic speech recognition
(ASR). A beamformer attenuates background noise by enhancing sound components …

保存引用被引用数: 270 関連記事全 3 バージョン

[Free GPT-4]

[PDF] arxiv.org

Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition

AS Subramanian, C Weng, S Watanabe, M Yu… - Computer Speech & …, 2022 - Elsevier

Multi-source localization is an important and challenging technique for multi-talker
conversation analysis. This paper proposes a novel supervised learning method using deep …

保存引用被引用数: 82 関連記事全 5 バージョン

[Free GPT-4]

[PDF] arxiv.org

ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration

C Li, J Shi, W Zhang, AS Subramanian… - 2021 IEEE Spoken …, 2021 - ieeexplore.ieee.org

We present ESPnet-SE, which is designed for the quick development of speech
enhancement and speech separation systems in a single framework, along with the optional …

保存引用被引用数: 95 関連記事全 5 バージョン

[Free GPT-4]

[PDF] isca-archive.org

[PDF][PDF] Front-end processing for the CHiME-5 dinner party scenario

C Boeddeker, J Heitkaemper… - CHiME5 Workshop …, 2018 - isca-archive.org

This contribution presents a speech enhancement system for the CHiME-5 Dinner Party
Scenario. The front-end employs multi-channel linear time-variant filtering and achieves its …

保存引用被引用数: 142 関連記事全 9 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

On optimal frequency-domain multichannel linear filtering for noise reduction

Past review, current progress, and challenges ahead on the cocktail party problem

Recent developments in speech enhancement in the short-time Fourier transform domain

A consolidated perspective on multimicrophone speech enhancement and source separation

[PDF][PDF] Improved MVDR beamforming using single-channel mask prediction networks.

Multi-channel deep clustering: Discriminative spectral and spatial embeddings for speaker-independent speech separation

Far-field automatic speech recognition

Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise

Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition

ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration

[PDF][PDF] Front-end processing for the CHiME-5 dinner party scenario