Exploiting morphological and phonological features to improve prosodic phrasing for mongolian speech synthesis
Prosodic phrasing is an important factor that affects naturalness and intelligibility in text-to-
speech synthesis. Studies show that deep learning techniques improve prosodic phrasing …
speech synthesis. Studies show that deep learning techniques improve prosodic phrasing …
Using continuous lexical embeddings to improve symbolic-prosody prediction in a text-to-speech front-end
The prediction of symbolic prosodic categories from text is an important, but challenging,
natural-language processing task given the various ways in which an input can be realized …
natural-language processing task given the various ways in which an input can be realized …
[PDF][PDF] Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
In the speech synthesis systems, the phrase break (PB) prediction is the first and most
important step. Recently, the state-of-the-art PB prediction systems mainly rely on word …
important step. Recently, the state-of-the-art PB prediction systems mainly rely on word …
[PDF][PDF] Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach.
Hierarchical prosody structure generation is an important but challenging component for
speech synthesis systems. In this paper, we investigate the use of enhanced embedding …
speech synthesis systems. In this paper, we investigate the use of enhanced embedding …
[PDF][PDF] An Investigation of Recurrent Neural Network Architectures Using Word Embeddings for Phrase Break Prediction.
This paper presents our investigations of recurrent neural networks (RNNs) for the phrase
break prediction task. With the advent of deep learning, there have been attempts to apply …
break prediction task. With the advent of deep learning, there have been attempts to apply …
Data-driven pause prediction for synthesis of storytelling style speech based on discourse modes
In storytelling style, a storyteller generally uses prosodic variations with subtle speech
nuances for the better apprehension of the listeners. It is achieved by emphasizing …
nuances for the better apprehension of the listeners. It is achieved by emphasizing …
A lstm approach with sub-word embeddings for mongolian phrase break prediction
In this paper, we first utilize the word embedding that focuses on sub-word units to the
Mongolian Phrase Break (PB) prediction task by using Long-Short-Term-Memory (LSTM) …
Mongolian Phrase Break (PB) prediction task by using Long-Short-Term-Memory (LSTM) …
An investigation of speaker independent phrase break models in End-to-End TTS systems
A Vadapalli - arxiv preprint arxiv:2304.04157, 2023 - arxiv.org
This paper presents our work on phrase break prediction in the context of end-to-end TTS
systems, motivated by the following questions:(i) Is there any utility in incorporating an …
systems, motivated by the following questions:(i) Is there any utility in incorporating an …
An Investigation of Phrase Break Prediction in an End-to-End TTS System
A Vadapalli - SN Computer Science, 2025 - Springer
This work explores the use of external phrase break prediction models to enhance listener
comprehension in end-to-end text-to-speech (TTS) systems. The effectiveness of these …
comprehension in end-to-end text-to-speech (TTS) systems. The effectiveness of these …
Modeling pauses for synthesis of storytelling style speech using unsupervised word features
In the storytelling style speech pauses or phrase breaks play a significant role in introducing
suspense and climax in the story. More often pauses are provided by a storyteller to capture …
suspense and climax in the story. More often pauses are provided by a storyteller to capture …