Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Generative adversarial networks for speech processing: A review
Generative adversarial networks (GANs) have seen remarkable progress in recent years.
They are used as generative models for all kinds of data such as text, images, audio, music …
They are used as generative models for all kinds of data such as text, images, audio, music …
Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis
P Ochieng - Artificial Intelligence Review, 2023 - Springer
Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …
natural language processing and computer vision. They have achieved great success in …
Real time speech enhancement in the waveform domain
We present a causal speech enhancement model working on the raw waveform that runs in
real-time on a laptop CPU. The proposed model is based on an encoder-decoder …
real-time on a laptop CPU. The proposed model is based on an encoder-decoder …
Conditional diffusion probabilistic model for speech enhancement
Speech enhancement is a critical component of many user-oriented audio applications, yet
current systems still suffer from distorted and unnatural outputs. While generative models …
current systems still suffer from distorted and unnatural outputs. While generative models …
Universal speech enhancement with score-based diffusion
Removing background noise from speech audio has been the subject of considerable effort,
especially in recent years due to the rise of virtual communication and amateur recordings …
especially in recent years due to the rise of virtual communication and amateur recordings …
HiFi-GAN: High-fidelity denoising and dereverberation based on speech deep features in adversarial networks
Real-world audio recordings are often degraded by factors such as noise, reverberation,
and equalization distortion. This paper introduces HiFi-GAN, a deep learning method to …
and equalization distortion. This paper introduces HiFi-GAN, a deep learning method to …
Cmgan: Conformer-based metric-gan for monaural speech enhancement
S Abdulatif, R Cao, B Yang - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
In this work, we further develop the conformer-based metric generative adversarial network
(CMGAN) model 1 for speech enhancement (SE) in the time-frequency (TF) domain. This …
(CMGAN) model 1 for speech enhancement (SE) in the time-frequency (TF) domain. This …
Speech denoising in the waveform domain with self-attention
In this work, we present CleanUNet, a causal speech denoising model on the raw waveform.
The proposed model is based on an encoder-decoder architecture combined with several …
The proposed model is based on an encoder-decoder architecture combined with several …
A time-frequency attention module for neural speech enhancement
Speech enhancement plays an essential role in a wide range of speech processing
applications. Recent studies on speech enhancement tend to investigate how to effectively …
applications. Recent studies on speech enhancement tend to investigate how to effectively …
A study on speech enhancement based on diffusion probabilistic model
Diffusion probabilistic models have demonstrated an outstanding capability to model natural
images and raw audio waveforms through a paired diffusion and reverse processes. The …
images and raw audio waveforms through a paired diffusion and reverse processes. The …