Exploiting morphological and phonological features to improve prosodic phrasing for mongolian speech synthesis

R Liu, B Sisman, F Bao, J Yang… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Prosodic phrasing is an important factor that affects naturalness and intelligibility in text-to-
speech synthesis. Studies show that deep learning techniques improve prosodic phrasing …

Using continuous lexical embeddings to improve symbolic-prosody prediction in a text-to-speech front-end

A Rendel, R Fernandez, R Hoory… - … on Acoustics, Speech …, 2016 - ieeexplore.ieee.org
The prediction of symbolic prosodic categories from text is an important, but challenging,
natural-language processing task given the various ways in which an input can be realized …

[PDF][PDF] Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.

R Liu, F Bao, G Gao, H Zhang, Y Wang - Interspeech, 2018 - ttslr.github.io
In the speech synthesis systems, the phrase break (PB) prediction is the first and most
important step. Recently, the state-of-the-art PB prediction systems mainly rely on word …

[PDF][PDF] Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach.

Y Zheng, Y Li, Z Wen, X Ding, J Tao - INTERSPEECH, 2016 - isca-archive.org
Hierarchical prosody structure generation is an important but challenging component for
speech synthesis systems. In this paper, we investigate the use of enhanced embedding …

[PDF][PDF] An Investigation of Recurrent Neural Network Architectures Using Word Embeddings for Phrase Break Prediction.

A Vadapalli, SV Gangashetty - Interspeech, 2016 - academia.edu
This paper presents our investigations of recurrent neural networks (RNNs) for the phrase
break prediction task. With the advent of deep learning, there have been attempts to apply …

Data-driven pause prediction for synthesis of storytelling style speech based on discourse modes

P Sarkar, KS Rao - 2015 IEEE International Conference on …, 2015 - ieeexplore.ieee.org
In storytelling style, a storyteller generally uses prosodic variations with subtle speech
nuances for the better apprehension of the listeners. It is achieved by emphasizing …

A lstm approach with sub-word embeddings for mongolian phrase break prediction

R Liu, F Bao, G Gao, H Zhang… - Proceedings of the 27th …, 2018 - aclanthology.org
In this paper, we first utilize the word embedding that focuses on sub-word units to the
Mongolian Phrase Break (PB) prediction task by using Long-Short-Term-Memory (LSTM) …

An investigation of speaker independent phrase break models in End-to-End TTS systems

A Vadapalli - arxiv preprint arxiv:2304.04157, 2023 - arxiv.org
This paper presents our work on phrase break prediction in the context of end-to-end TTS
systems, motivated by the following questions:(i) Is there any utility in incorporating an …

An Investigation of Phrase Break Prediction in an End-to-End TTS System

A Vadapalli - SN Computer Science, 2025 - Springer
This work explores the use of external phrase break prediction models to enhance listener
comprehension in end-to-end text-to-speech (TTS) systems. The effectiveness of these …

Modeling pauses for synthesis of storytelling style speech using unsupervised word features

P Sarkar, KS Rao - Procedia Computer Science, 2015 - Elsevier
In the storytelling style speech pauses or phrase breaks play a significant role in introducing
suspense and climax in the story. More often pauses are provided by a storyteller to capture …