Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Supervised speech separation based on deep learning: An overview
Speech separation is the task of separating target speech from background interference.
Traditionally, speech separation is studied as a signal processing problem. A more recent …
Traditionally, speech separation is studied as a signal processing problem. A more recent …
Sixty years of frequency-domain monaural speech enhancement: From traditional to deep learning methods
Frequency-domain monaural speech enhancement has been extensively studied for over
60 years, and a great number of methods have been proposed and applied to many …
60 years, and a great number of methods have been proposed and applied to many …
DCCRN: Deep complex convolution recurrent network for phase-aware speech enhancement
Speech enhancement has benefited from the success of deep learning in terms of
intelligibility and perceptual quality. Conventional time-frequency (TF) domain methods …
intelligibility and perceptual quality. Conventional time-frequency (TF) domain methods …
Real time speech enhancement in the waveform domain
We present a causal speech enhancement model working on the raw waveform that runs in
real-time on a laptop CPU. The proposed model is based on an encoder-decoder …
real-time on a laptop CPU. The proposed model is based on an encoder-decoder …
Metricgan+: An improved version of metricgan for speech enhancement
The discrepancy between the cost function used for training a speech enhancement model
and human auditory perception usually makes the quality of enhanced speech …
and human auditory perception usually makes the quality of enhanced speech …
SDR–half-baked or well done?
In speech enhancement and source separation, signal-to-noise ratio is a ubiquitous
objective measure of denoising/separation quality. A decade ago, the BSS_eval toolkit was …
objective measure of denoising/separation quality. A decade ago, the BSS_eval toolkit was …
Deep learning for audio signal processing
Given the recent surge in developments of deep learning, this paper provides a review of the
state-of-the-art deep learning techniques for audio signal processing. Speech, music, and …
state-of-the-art deep learning techniques for audio signal processing. Speech, music, and …
Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation
We present a joint audio-visual model for isolating a single speech signal from a mixture of
sounds such as other speakers and background noise. Solving this task using only audio as …
sounds such as other speakers and background noise. Solving this task using only audio as …
Phasen: A phase-and-harmonics-aware speech enhancement network
Time-frequency (TF) domain masking is a mainstream approach for single-channel speech
enhancement. Recently, focuses have been put to phase prediction in addition to amplitude …
enhancement. Recently, focuses have been put to phase prediction in addition to amplitude …
SEGAN: Speech enhancement generative adversarial network
Current speech enhancement techniques operate on the spectral domain and/or exploit
some higher-level feature. The majority of them tackle a limited number of noise conditions …
some higher-level feature. The majority of them tackle a limited number of noise conditions …