Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
An overview of voice conversion systems
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …
A review on human-computer interaction and intelligent robots
F Ren, Y Bao - International Journal of Information Technology & …, 2020 - World Scientific
In the field of artificial intelligence, human–computer interaction (HCI) technology and its
related intelligent robot technologies are essential and interesting contents of research …
related intelligent robot technologies are essential and interesting contents of research …
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
Automatic speaker verification (ASV) is one of the most natural and convenient means of
biometric person recognition. Unfortunately, just like all other biometric systems, ASV is …
biometric person recognition. Unfortunately, just like all other biometric systems, ASV is …
Tacotron: Towards end-to-end speech synthesis
A text-to-speech synthesis system typically consists of multiple stages, such as a text
analysis frontend, an acoustic model and an audio synthesis module. Building these …
analysis frontend, an acoustic model and an audio synthesis module. Building these …
[PDF][PDF] Wavenet: A generative model for raw audio
This paper introduces WaveNet, a deep neural network for generating raw audio waveforms.
The model is fully probabilistic and autoregressive, with the predictive distribution for each …
The model is fully probabilistic and autoregressive, with the predictive distribution for each …
Wavenet: A generative model for raw audio
This paper introduces WaveNet, a deep neural network for generating raw audio waveforms.
The model is fully probabilistic and autoregressive, with the predictive distribution for each …
The model is fully probabilistic and autoregressive, with the predictive distribution for each …
Deep voice 3: Scaling text-to-speech with convolutional sequence learning
We present Deep Voice 3, a fully-convolutional attention-based neural text-to-speech (TTS)
system. Deep Voice 3 matches state-of-the-art neural speech synthesis systems in …
system. Deep Voice 3 matches state-of-the-art neural speech synthesis systems in …
WORLD: a vocoder-based high-quality speech synthesis system for real-time applications
A vocoder-based speech synthesis system, named WORLD, was developed in an effort to
improve the sound quality of real-time applications using speech. Speech analysis …
improve the sound quality of real-time applications using speech. Speech analysis …
[PDF][PDF] Tacotron: A fully end-to-end text-to-speech synthesis model
ABSTRACT A text-to-speech synthesis system typically consists of multiple stages, such as a
text analysis frontend, an acoustic model and an audio synthesis module. Building these …
text analysis frontend, an acoustic model and an audio synthesis module. Building these …
Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis
Long short-term memory recurrent neural networks (LSTM-RNNs) have been applied to
various speech applications including acoustic modeling for statistical parametric speech …
various speech applications including acoustic modeling for statistical parametric speech …