Google Академія

Improving GANs for speech enhancement

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Generative adversarial networks for speech processing: A review

A Wali, Z Alamgir, S Karim, A Fawaz, MB Ali… - Computer Speech & …, 2022 - Elsevier

Generative adversarial networks (GANs) have seen remarkable progress in recent years.
They are used as generative models for all kinds of data such as text, images, audio, music …

Зберегти Послатися Цитовано в 63 джерелах Пов’язані статті Кількість версій: 2

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

P Ochieng - Artificial Intelligence Review, 2023 - Springer

Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …

Зберегти Послатися Цитовано в 27 джерелах Пов’язані статті Кількість версій: 10

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Real time speech enhancement in the waveform domain

A Defossez, G Synnaeve, Y Adi - arxiv preprint arxiv:2006.12847, 2020 - arxiv.org

We present a causal speech enhancement model working on the raw waveform that runs in
real-time on a laptop CPU. The proposed model is based on an encoder-decoder …

Зберегти Послатися Цитовано в 576 джерелах Пов’язані статті Кількість версій: 7 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Conditional diffusion probabilistic model for speech enhancement

YJ Lu, ZQ Wang, S Watanabe… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Speech enhancement is a critical component of many user-oriented audio applications, yet
current systems still suffer from distorted and unnatural outputs. While generative models …

Зберегти Послатися Цитовано в 181 джерелах Пов’язані статті Кількість версій: 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Universal speech enhancement with score-based diffusion

J Serrà, S Pascual, J Pons, RO Araz… - arxiv preprint arxiv …, 2022 - arxiv.org

Removing background noise from speech audio has been the subject of considerable effort,
especially in recent years due to the rise of virtual communication and amateur recordings …

Зберегти Послатися Цитовано в 95 джерелах Пов’язані статті Кількість версій: 6 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

HiFi-GAN: High-fidelity denoising and dereverberation based on speech deep features in adversarial networks

J Su, Z **, A Finkelstein - arxiv preprint arxiv:2006.05694, 2020 - arxiv.org

Real-world audio recordings are often degraded by factors such as noise, reverberation,
and equalization distortion. This paper introduces HiFi-GAN, a deep learning method to …

Зберегти Послатися Цитовано в 179 джерелах Пов’язані статті Кількість версій: 8 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Cmgan: Conformer-based metric-gan for monaural speech enhancement

S Abdulatif, R Cao, B Yang - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org

In this work, we further develop the conformer-based metric generative adversarial network
(CMGAN) model 1 for speech enhancement (SE) in the time-frequency (TF) domain. This …

Зберегти Послатися Цитовано в 61 джерелах Пов’язані статті Кількість версій: 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Speech denoising in the waveform domain with self-attention

Z Kong, W **, A Dantrey… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

In this work, we present CleanUNet, a causal speech denoising model on the raw waveform.
The proposed model is based on an encoder-decoder architecture combined with several …

Зберегти Послатися Цитовано в 77 джерелах Пов’язані статті Кількість версій: 6

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

A time-frequency attention module for neural speech enhancement

Q Zhang, X Qian, Z Ni, A Nicolson… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org

Speech enhancement plays an essential role in a wide range of speech processing
applications. Recent studies on speech enhancement tend to investigate how to effectively …

Зберегти Послатися Цитовано в 39 джерелах Пов’язані статті Кількість версій: 3

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A study on speech enhancement based on diffusion probabilistic model

YJ Lu, Y Tsao, S Watanabe - 2021 Asia-Pacific Signal and …, 2021 - ieeexplore.ieee.org

Diffusion probabilistic models have demonstrated an outstanding capability to model natural
images and raw audio waveforms through a paired diffusion and reverse processes. The …

Зберегти Послатися Цитовано в 72 джерелах Пов’язані статті Кількість версій: 8

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Improving GANs for speech enhancement

Generative adversarial networks for speech processing: A review

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

Real time speech enhancement in the waveform domain

Conditional diffusion probabilistic model for speech enhancement

Universal speech enhancement with score-based diffusion

HiFi-GAN: High-fidelity denoising and dereverberation based on speech deep features in adversarial networks

Cmgan: Conformer-based metric-gan for monaural speech enhancement

Speech denoising in the waveform domain with self-attention

A time-frequency attention module for neural speech enhancement

A study on speech enhancement based on diffusion probabilistic model