- Academic Search

H Zen, K Tokuda, AW Black - speech communication, 2009‏ - Elsevier‏

This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …‏

שמור צטט צוטט על ידי 1657 מאמרים בנושא זה כל 22 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

Statistical parametric speech synthesis using deep neural networks‏

H Zen, A Senior, M Schuster - 2013 ieee international …, 2013‏ - ieeexplore.ieee.org‏

Conventional approaches to statistical parametric speech synthesis typically use decision
tree-clustered context-dependent hidden Markov models (HMMs) to represent probability …‏

שמור צטט צוטט על ידי 1186 מאמרים בנושא זה כל 11 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Statistical parametric speech synthesis incorporating generative adversarial networks‏

Y Saito, S Takamichi… - IEEE/ACM Transactions on …, 2017‏ - ieeexplore.ieee.org‏

A method for statistical parametric speech synthesis incorporating generative adversarial
networks (GANs) is proposed. Although powerful deep neural networks techniques can be …‏

שמור צטט צוטט על ידי 274 מאמרים בנושא זה כל 9 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] audentia-gestion.fr

Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis‏

H Zen, H Sak - … Conference on Acoustics, Speech and Signal …, 2015‏ - ieeexplore.ieee.org‏

Long short-term memory recurrent neural networks (LSTM-RNNs) have been applied to
various speech applications including acoustic modeling for statistical parametric speech …‏

שמור צטט צוטט על ידי 393 מאמרים בנושא זה כל 10 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] google.com

Deep mixture density networks for acoustic modeling in statistical parametric speech synthesis‏

H Zen, A Senior - … conference on acoustics, speech and signal …, 2014‏ - ieeexplore.ieee.org‏

Statistical parametric speech synthesis (SPSS) using deep neural networks (DNNs) has
shown its potential to produce naturally-sounding synthesized speech. However, there are …‏

שמור צטט צוטט על ידי 285 מאמרים בנושא זה כל 8 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Prompttts++: Controlling speaker identity in prompt-based text-to-speech using natural language descriptions‏

R Shimizu, R Yamamoto, M Kawamura… - ICASSP 2024-2024 …, 2024‏ - ieeexplore.ieee.org‏

We propose PromptTTS++, a prompt-based text-to-speech (TTS) synthesis system that
allows control over speaker identity using natural language descriptions. To control speaker …‏

שמור צטט צוטט על ידי 22 מאמרים בנושא זה כל 4 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Source-filter HiFi-GAN: Fast and pitch controllable high-fidelity neural vocoder‏

R Yoneyama, YC Wu, T Toda - ICASSP 2023-2023 IEEE …, 2023‏ - ieeexplore.ieee.org‏

Our previous work, the unified source-filter GAN (uSFGAN) vocoder, introduced a novel
architecture based on the source-filter theory into the parallel waveform generative …‏

שמור צטט צוטט על ידי 34 מאמרים בנושא זה כל 6 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] Harvest: A High-Performance Fundamental Frequency Estimator from Speech Signals.‏

M Morise - INTERSPEECH, 2017‏ - isca-archive.org‏

A fundamental frequency (F0) estimator named Harvest is described. The unique points of
Harvest are that it can obtain a reliable F0 contour and reduce the error that the voiced …‏

שמור צטט צוטט על ידי 122 מאמרים בנושא זה כל 3 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] Singing Voice Synthesis Based on Deep Neural Networks.‏

M Nishimura, K Hashimoto, K Oura, Y Nankaku… - Interspeech, 2016‏ - isca-archive.org‏

Singing voice synthesis techniques have been proposed based on a hidden Markov model
(HMM). In these approaches, the spectrum, excitation, and duration of singing voices are …‏

שמור צטט צוטט על ידי 111 מאמרים בנושא זה כל 3 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] anu.edu.au

A comparative study of different classifiers for detecting depression from spontaneous speech‏

S Alghowinem, R Goecke, M Wagner… - … on acoustics, speech …, 2013‏ - ieeexplore.ieee.org‏

Accurate detection of depression from spontaneous speech could lead to an objective
diagnostic aid to assist clinicians to better diagnose depression. Little thought has been …‏

שמור צטט צוטט על ידי 137 מאמרים בנושא זה כל 11 הגרסאות

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Continuous F0 modeling for HMM based statistical parametric speech synthesis

Statistical parametric speech synthesis‏

Statistical parametric speech synthesis using deep neural networks‏

Statistical parametric speech synthesis incorporating generative adversarial networks‏

Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis‏

Deep mixture density networks for acoustic modeling in statistical parametric speech synthesis‏

Prompttts++: Controlling speaker identity in prompt-based text-to-speech using natural language descriptions‏

Source-filter HiFi-GAN: Fast and pitch controllable high-fidelity neural vocoder‏

[PDF][PDF] Harvest: A High-Performance Fundamental Frequency Estimator from Speech Signals.‏

[PDF][PDF] Singing Voice Synthesis Based on Deep Neural Networks.‏

A comparative study of different classifiers for detecting depression from spontaneous speech‏