Conventional and contemporary approaches used in text to speech synthesis: A review

N Kaur, P Singh - Artificial Intelligence Review, 2023‏ - Springer
Nowadays speech synthesis or text to speech (TTS), an ability of system to produce human
like natural sounding voice from the written text, is gaining popularity in the field of speech …

Statistical parametric speech synthesis

H Zen, K Tokuda, AW Black - speech communication, 2009‏ - Elsevier
This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …

Statistical parametric speech synthesis using deep neural networks

H Zen, A Senior, M Schuster - 2013 ieee international …, 2013‏ - ieeexplore.ieee.org
Conventional approaches to statistical parametric speech synthesis typically use decision
tree-clustered context-dependent hidden Markov models (HMMs) to represent probability …

Speech synthesis with mixed emotions

K Zhou, B Sisman, R Rana… - IEEE Transactions on …, 2022‏ - ieeexplore.ieee.org
Emotional speech synthesis aims to synthesize human voices with various emotional effects.
The current studies are mostly focused on imitating an averaged style belonging to a specific …

Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis

H Zen, H Sak - … Conference on Acoustics, Speech and Signal …, 2015‏ - ieeexplore.ieee.org
Long short-term memory recurrent neural networks (LSTM-RNNs) have been applied to
various speech applications including acoustic modeling for statistical parametric speech …

Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory

T Toda, AW Black, K Tokuda - IEEE Transactions on Audio …, 2007‏ - ieeexplore.ieee.org
In this paper, we describe a novel spectral conversion method for voice conversion (VC). A
Gaussian mixture model (GMM) of the joint probability density of source and target features …

Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends

ZH Ling, SY Kang, H Zen, A Senior… - IEEE Signal …, 2015‏ - ieeexplore.ieee.org
Hidden Markov models (HMMs) and Gaussian mixture models (GMMs) are the two most
common types of acoustic models used in statistical parametric approaches for generating …

Synthetic speech detection through short-term and long-term prediction traces

C Borrelli, P Bestagini, F Antonacci, A Sarti… - EURASIP Journal on …, 2021‏ - Springer
Several methods for synthetic audio speech generation have been developed in the
literature through the years. With the great technological advances brought by deep …

Speech synthesis based on hidden Markov models

K Tokuda, Y Nankaku, T Toda, H Zen… - Proceedings of the …, 2013‏ - ieeexplore.ieee.org
This paper gives a general overview of hidden Markov model (HMM)-based speech
synthesis, which has recently been demonstrated to be very effective in synthesizing …

Method and system for non-parametric voice conversion

I Agiomyrgiannakis - US Patent 9,183,830, 2015‏ - Google Patents
GIOL I5/04(2013.01) A method and system is disclosed for non-parametric speech GIOL
I5/4(2006.01) conversion. A text-to-speech (TTS) synthesis system may GIOL I3/02(2013.01) …