Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Statistical parametric speech synthesis
This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …
Statistical parametric speech synthesis using deep neural networks
Conventional approaches to statistical parametric speech synthesis typically use decision
tree-clustered context-dependent hidden Markov models (HMMs) to represent probability …
tree-clustered context-dependent hidden Markov models (HMMs) to represent probability …
Statistical parametric speech synthesis incorporating generative adversarial networks
A method for statistical parametric speech synthesis incorporating generative adversarial
networks (GANs) is proposed. Although powerful deep neural networks techniques can be …
networks (GANs) is proposed. Although powerful deep neural networks techniques can be …
Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis
Long short-term memory recurrent neural networks (LSTM-RNNs) have been applied to
various speech applications including acoustic modeling for statistical parametric speech …
various speech applications including acoustic modeling for statistical parametric speech …
Deep mixture density networks for acoustic modeling in statistical parametric speech synthesis
Statistical parametric speech synthesis (SPSS) using deep neural networks (DNNs) has
shown its potential to produce naturally-sounding synthesized speech. However, there are …
shown its potential to produce naturally-sounding synthesized speech. However, there are …
Prompttts++: Controlling speaker identity in prompt-based text-to-speech using natural language descriptions
We propose PromptTTS++, a prompt-based text-to-speech (TTS) synthesis system that
allows control over speaker identity using natural language descriptions. To control speaker …
allows control over speaker identity using natural language descriptions. To control speaker …
Source-filter HiFi-GAN: Fast and pitch controllable high-fidelity neural vocoder
Our previous work, the unified source-filter GAN (uSFGAN) vocoder, introduced a novel
architecture based on the source-filter theory into the parallel waveform generative …
architecture based on the source-filter theory into the parallel waveform generative …
[PDF][PDF] Harvest: A High-Performance Fundamental Frequency Estimator from Speech Signals.
A fundamental frequency (F0) estimator named Harvest is described. The unique points of
Harvest are that it can obtain a reliable F0 contour and reduce the error that the voiced …
Harvest are that it can obtain a reliable F0 contour and reduce the error that the voiced …
[PDF][PDF] Singing Voice Synthesis Based on Deep Neural Networks.
Singing voice synthesis techniques have been proposed based on a hidden Markov model
(HMM). In these approaches, the spectrum, excitation, and duration of singing voices are …
(HMM). In these approaches, the spectrum, excitation, and duration of singing voices are …
A comparative study of different classifiers for detecting depression from spontaneous speech
Accurate detection of depression from spontaneous speech could lead to an objective
diagnostic aid to assist clinicians to better diagnose depression. Little thought has been …
diagnostic aid to assist clinicians to better diagnose depression. Little thought has been …