Statistical parametric speech synthesis

H Zen, K Tokuda, AW Black - speech communication, 2009 - Elsevier
This review gives a general overview of techniques used in statistical parametric speech
synthesis. One instance of these techniques, called hidden Markov model (HMM)-based …

Speech synthesis based on hidden Markov models

K Tokuda, Y Nankaku, T Toda, H Zen… - Proceedings of the …, 2013 - ieeexplore.ieee.org
This paper gives a general overview of hidden Markov model (HMM)-based speech
synthesis, which has recently been demonstrated to be very effective in synthesizing …

HMM-based speech synthesis utilizing glottal inverse filtering

T Raitio, A Suni, J Yamagishi, H Pulakka… - IEEE transactions on …, 2010 - ieeexplore.ieee.org
This paper describes an hidden Markov model (HMM)-based speech synthesizer that
utilizes glottal inverse filtering for generating natural sounding synthetic speech. In the …

Investigating fuzzy-input fuzzy-output support vector machines for robust voice quality classification

S Scherer, J Kane, C Gobl, F Schwenker - Computer Speech & Language, 2013 - Elsevier
The dynamic use of voice qualities in spoken language can reveal useful information on a
speakers attitude, mood and affective states. This information may be very desirable for a …

Learning interpretable control dimensions for speech synthesis by using external data

Z Hodari, O Watts, S Ronanki, S King - Interspeech 2018, 2018 - research.ed.ac.uk
There are many aspects of speech that we might want to control when creating text-to-
speech (TTS) systems. We present a general method that enables control of arbitrary …

Phase minimization for glottal model estimation

G Degottex, A Roebel, X Rodet - IEEE Transactions on Audio …, 2010 - ieeexplore.ieee.org
In glottal source analysis, the phase minimization criterion has already been proposed to
detect excitation instants. As shown in this paper, this criterion can also be used to estimate …

HMM-based speech synthesiser using the LF-model of the glottal source

JP Cabral, S Renals, J Yamagishi… - … on Acoustics, Speech …, 2011 - ieeexplore.ieee.org
A major factor which causes a deterioration in speech quality in HMM-based speech
synthesis is the use of a simple delta pulse signal to generate the excitation of voiced …

Synthesis and perception of breathy, normal, and lombard speech in the presence of noise

T Raitio, A Suni, M Vainio, P Alku - Computer Speech & Language, 2014 - Elsevier
This papers studies the synthesis of speech over a wide vocal effort continuum and its
perception in the presence of noise. Three types of speech are recorded and studied along …

The current state of Finnish NLP

M Hämäläinen, K Alnajjar - arxiv preprint arxiv:2109.11326, 2021 - arxiv.org
There are a lot of tools and resources available for processing Finnish. In this paper, we
survey recent papers focusing on Finnish NLP related to many different subcategories of …

High-pitched excitation generation for glottal vocoding in statistical parametric speech synthesis using a deep neural network

L Juvela, B Bollepalli, M Airaksinen… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
Achieving high quality and naturalness in statistical parametric synthesis of female voices
remains to be difficult despite recent advances in the study area. Vocoding is one such key …