Conventional and contemporary approaches used in text to speech synthesis: A review

N Kaur, P Singh - Artificial Intelligence Review, 2023 - Springer
Nowadays speech synthesis or text to speech (TTS), an ability of system to produce human
like natural sounding voice from the written text, is gaining popularity in the field of speech …

Statistical parametric speech synthesis

H Zen, K Tokuda, AW Black - speech communication, 2009 - Elsevier
This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …

Statistical parametric speech synthesis using deep neural networks

H Zen, A Senior, M Schuster - 2013 ieee international …, 2013 - ieeexplore.ieee.org
Conventional approaches to statistical parametric speech synthesis typically use decision
tree-clustered context-dependent hidden Markov models (HMMs) to represent probability …

Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends

ZH Ling, SY Kang, H Zen, A Senior… - IEEE Signal …, 2015 - ieeexplore.ieee.org
Hidden Markov models (HMMs) and Gaussian mixture models (GMMs) are the two most
common types of acoustic models used in statistical parametric approaches for generating …

Speech synthesis based on hidden Markov models

K Tokuda, Y Nankaku, T Toda, H Zen… - Proceedings of the …, 2013 - ieeexplore.ieee.org
This paper gives a general overview of hidden Markov model (HMM)-based speech
synthesis, which has recently been demonstrated to be very effective in synthesizing …

[PDF][PDF] The HMM-based speech synthesis system (HTS) version 2.0.

H Zen, T Nose, J Yamagishi, S Sako, T Masuko… - SSW, 2007 - cs.cmu.edu
A statistical parametric speech synthesis system based on hidden Markov models (HMMs)
has grown in popularity over the last few years. This system simultaneouslymodels …

Deep mixture density networks for acoustic modeling in statistical parametric speech synthesis

H Zen, A Senior - … conference on acoustics, speech and signal …, 2014 - ieeexplore.ieee.org
Statistical parametric speech synthesis (SPSS) using deep neural networks (DNNs) has
shown its potential to produce naturally-sounding synthesized speech. However, there are …

Evaluation of speaker verification security and detection of HMM-based synthetic speech

PL De Leon, M Pucher, J Yamagishi… - … on Audio, Speech …, 2012 - ieeexplore.ieee.org
In this paper, we evaluate the vulnerability of speaker verification (SV) systems to synthetic
speech. The SV systems are based on either the Gaussian mixture model–universal …

Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm

J Yamagishi, T Kobayashi, Y Nakano… - … on Audio, Speech …, 2009 - ieeexplore.ieee.org
In this paper, we analyze the effects of several factors and configuration choices
encountered during training and model construction when we want to obtain better and …

HMM-based speech synthesis utilizing glottal inverse filtering

T Raitio, A Suni, J Yamagishi, H Pulakka… - IEEE transactions on …, 2010 - ieeexplore.ieee.org
This paper describes an hidden Markov model (HMM)-based speech synthesizer that
utilizes glottal inverse filtering for generating natural sounding synthetic speech. In the …