Google Академія

DL Wang, J Chen - IEEE/ACM transactions on audio, speech …, 2018 - ieeexplore.ieee.org

Speech separation is the task of separating target speech from background interference.
Traditionally, speech separation is studied as a signal processing problem. A more recent …

Зберегти Послатися Цитовано в 1647 джерелах Пов’язані статті Кількість версій: 14

[Free GPT-4]
[DeepSeek]

[PDF] sagepub.com Full View

Sixty years of frequency-domain monaural speech enhancement: From traditional to deep learning methods

C Zheng, H Zhang, W Liu, X Luo, A Li, X Li… - Trends in …, 2023 - journals.sagepub.com

Frequency-domain monaural speech enhancement has been extensively studied for over
60 years, and a great number of methods have been proposed and applied to many …

Зберегти Послатися Цитовано в 46 джерелах Пов’язані статті Кількість версій: 9

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

DCCRN: Deep complex convolution recurrent network for phase-aware speech enhancement

Y Hu, Y Liu, S Lv, M **ng, S Zhang, Y Fu, J Wu… - arxiv preprint arxiv …, 2020 - arxiv.org

Speech enhancement has benefited from the success of deep learning in terms of
intelligibility and perceptual quality. Conventional time-frequency (TF) domain methods …

Зберегти Послатися Цитовано в 746 джерелах Пов’язані статті Кількість версій: 11 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Real time speech enhancement in the waveform domain

A Defossez, G Synnaeve, Y Adi - arxiv preprint arxiv:2006.12847, 2020 - arxiv.org

We present a causal speech enhancement model working on the raw waveform that runs in
real-time on a laptop CPU. The proposed model is based on an encoder-decoder …

Зберегти Послатися Цитовано в 575 джерелах Пов’язані статті Кількість версій: 7 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Metricgan+: An improved version of metricgan for speech enhancement

SW Fu, C Yu, TA Hsieh, P Plantinga… - arxiv preprint arxiv …, 2021 - arxiv.org

The discrepancy between the cost function used for training a speech enhancement model
and human auditory perception usually makes the quality of enhanced speech …

Зберегти Послатися Цитовано в 247 джерелах Пов’язані статті Кількість версій: 8 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

SDR–half-baked or well done?

J Le Roux, S Wisdom, H Erdogan… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

In speech enhancement and source separation, signal-to-noise ratio is a ubiquitous
objective measure of denoising/separation quality. A decade ago, the BSS_eval toolkit was …

Зберегти Послатися Цитовано в 1386 джерелах Пов’язані статті Кількість версій: 10

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deep learning for audio signal processing

H Purwins, B Li, T Virtanen, J Schlüter… - IEEE Journal of …, 2019 - ieeexplore.ieee.org

Given the recent surge in developments of deep learning, this paper provides a review of the
state-of-the-art deep learning techniques for audio signal processing. Speech, music, and …

Зберегти Послатися Цитовано в 918 джерелах Пов’язані статті Кількість версій: 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation

A Ephrat, I Mosseri, O Lang, T Dekel, K Wilson… - arxiv preprint arxiv …, 2018 - arxiv.org

We present a joint audio-visual model for isolating a single speech signal from a mixture of
sounds such as other speakers and background noise. Solving this task using only audio as …

Зберегти Послатися Цитовано в 953 джерелах Пов’язані статті Кількість версій: 6 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Phasen: A phase-and-harmonics-aware speech enhancement network

D Yin, C Luo, Z **ong, W Zeng - Proceedings of the AAAI conference on …, 2020 - ojs.aaai.org

Time-frequency (TF) domain masking is a mainstream approach for single-channel speech
enhancement. Recently, focuses have been put to phase prediction in addition to amplitude …

Зберегти Послатися Цитовано в 350 джерелах Пов’язані статті Кількість версій: 10 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

SEGAN: Speech enhancement generative adversarial network

S Pascual, A Bonafonte, J Serra - arxiv preprint arxiv:1703.09452, 2017 - arxiv.org

Current speech enhancement techniques operate on the spectral domain and/or exploit
some higher-level feature. The majority of them tackle a limited number of noise conditions …

Зберегти Послатися Цитовано в 1544 джерелах Пов’язані статті Кількість версій: 4 Показати у форматі HTML

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR

Supervised speech separation based on deep learning: An overview

Sixty years of frequency-domain monaural speech enhancement: From traditional to deep learning methods

DCCRN: Deep complex convolution recurrent network for phase-aware speech enhancement

Real time speech enhancement in the waveform domain

Metricgan+: An improved version of metricgan for speech enhancement

SDR–half-baked or well done?

Deep learning for audio signal processing

Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation

Phasen: A phase-and-harmonics-aware speech enhancement network

SEGAN: Speech enhancement generative adversarial network