Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Sixty years of frequency-domain monaural speech enhancement: From traditional to deep learning methods
Frequency-domain monaural speech enhancement has been extensively studied for over
60 years, and a great number of methods have been proposed and applied to many …
60 years, and a great number of methods have been proposed and applied to many …
An overview of image caption generation methods
H Wang, Y Zhang, X Yu - Computational intelligence and …, 2020 - Wiley Online Library
In recent years, with the rapid development of artificial intelligence, image caption has
gradually attracted the attention of many researchers in the field of artificial intelligence and …
gradually attracted the attention of many researchers in the field of artificial intelligence and …
Speech enhancement and dereverberation with diffusion-based generative models
In this work, we build upon our previous publication and use diffusion-based generative
models for speech enhancement. We present a detailed overview of the diffusion process …
models for speech enhancement. We present a detailed overview of the diffusion process …
Conditional diffusion probabilistic model for speech enhancement
Speech enhancement is a critical component of many user-oriented audio applications, yet
current systems still suffer from distorted and unnatural outputs. While generative models …
current systems still suffer from distorted and unnatural outputs. While generative models …
Metricgan+: An improved version of metricgan for speech enhancement
The discrepancy between the cost function used for training a speech enhancement model
and human auditory perception usually makes the quality of enhanced speech …
and human auditory perception usually makes the quality of enhanced speech …
TSTNN: Two-stage transformer based neural network for speech enhancement in the time domain
In this paper, we propose a transformer-based architecture, called two-stage transformer
neural network (TSTNN) for end-to-end speech denoising in the time domain. The proposed …
neural network (TSTNN) for end-to-end speech denoising in the time domain. The proposed …
Phasen: A phase-and-harmonics-aware speech enhancement network
Time-frequency (TF) domain masking is a mainstream approach for single-channel speech
enhancement. Recently, focuses have been put to phase prediction in addition to amplitude …
enhancement. Recently, focuses have been put to phase prediction in addition to amplitude …
SEGAN: Speech enhancement generative adversarial network
Current speech enhancement techniques operate on the spectral domain and/or exploit
some higher-level feature. The majority of them tackle a limited number of noise conditions …
some higher-level feature. The majority of them tackle a limited number of noise conditions …
Metricgan: Generative adversarial networks based black-box metric scores optimization for speech enhancement
Adversarial loss in a conditional generative adversarial network (GAN) is not designed to
directly optimize evaluation metrics of a target task, and thus, may not always guide the …
directly optimize evaluation metrics of a target task, and thus, may not always guide the …
StoRM: A diffusion-based stochastic regeneration model for speech enhancement and dereverberation
Diffusion models have shown a great ability at bridging the performance gap between
predictive and generative approaches for speech enhancement. We have shown that they …
predictive and generative approaches for speech enhancement. We have shown that they …