Modeling durations of syllables using neural networks
KS Rao, B Yegnanarayana - Computer Speech & Language, 2007 - Elsevier
In this paper, we propose a neural network model for predicting the durations of syllables. A
four layer feedforward neural network trained with backpropagation algorithm is used for …
four layer feedforward neural network trained with backpropagation algorithm is used for …
Role of neural network models for develo** speech systems
KS Rao - Sadhana, 2011 - Springer
This paper discusses the application of neural networks for develo** different speech
systems. Prosodic parameters of speech at syllable level depend on positional, contextual …
systems. Prosodic parameters of speech at syllable level depend on positional, contextual …
Segmental durations predicted with a neural network
JP Teixeira, DS Freitas - European Conference on Speech …, 2003 - bibliotecadigital.ipb.pt
This paper presents a segmental durations' model applied to the European Portuguese
language for TTS purposes. The model is based on a feed-forward neural network, trained …
language for TTS purposes. The model is based on a feed-forward neural network, trained …
Generation of suprasegmental information for speech using a recurrent neural network and binary gravitational search algorithm for feature selection
M Sheikhan - Applied intelligence, 2014 - Springer
Suprasegmental (prosody) features of discourse provide a vehicle by which speakers reflect
their mental purposes to listeners. Generating suitable prosody information is critical to …
their mental purposes to listeners. Generating suitable prosody information is critical to …
Modeling supra-segmental features of syllables using neural networks
KS Rao - Speech, audio, image and biomedical signal …, 2008 - Springer
In this chapter we discuss modeling of supra-segmental features (intonation and duration) of
syllables, and suggest some applications of these models. These supra-segmental features …
syllables, and suggest some applications of these models. These supra-segmental features …
Synthesizing suprasegmental speech information using hybrid of GA-ACO and dynamic neural network
M Sheikhan - The 5th Conference on Information and …, 2013 - ieeexplore.ieee.org
In generating natural speech by machines, removing the suprasegmental information (such
as stress, timing and pitch frequency) results in unpleasant speech. To provide this …
as stress, timing and pitch frequency) results in unpleasant speech. To provide this …
Analyzing Acoustic Markers of Emotion in Arabic Speech
MB Othman - 2019 - search.proquest.com
This study aims to obtain detailed acoustic knowledge of how speech is modulated when a
speakerâ?? s emotion changes from neutral to certain emotional states based on …
speakerâ?? s emotion changes from neutral to certain emotional states based on …
Use of phoneme dedicated artificial neural networks to predict segmental durations
JP Teixeira, DS Freitas - 10th International Conference on …, 2005 - bibliotecadigital.ipb.pt
The results of two alternative models to predict segmental durations in speech synthesis,
both based on Artificial Neural Networks (ANNs) are discussed. The ANN model consists in …
both based on Artificial Neural Networks (ANNs) are discussed. The ANN model consists in …
Learning continuous representation of text for phone duration modeling in statistical parametric speech synthesis
SK Rallabandi, SS Rallabandi, P Bandi… - … IEEE Workshop on …, 2015 - ieeexplore.ieee.org
In this paper, we investigate the usage of a continuous representation based approach of
the feature vector derived from input text to predict the phone durations in a Text to Speech …
the feature vector derived from input text to predict the phone durations in a Text to Speech …
Evaluation of a segmental durations model for tts
JP Teixeira, D Freitas - … Workshop on Computational Processing of the …, 2003 - Springer
In this paper we present a condensed description of a European Portuguese segmental
duration's model for TTS purposes and concentrate on its evaluation. This model is based …
duration's model for TTS purposes and concentrate on its evaluation. This model is based …