A survey on speech synthesis techniques in Indian languages

SP Panda, AK Nayak, SC Rai - Multimedia Systems, 2020 - Springer
The text to speech technology has achieved significant progress during the past decade and
is an active area of research and development in providing different human–computer …

End-to-end code-switching tts with cross-lingual language model

X Zhou, X Tian, G Lee, RK Das… - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
Code-switching text-to-speech (TTS) aims to enable a system to speak two languages with a
single voice and in the same utterance. In this paper, we propose to incorporate cross …

Building a mixed-lingual neural TTS system with only monolingual data

L Xue, W Song, G Xu, L **e, Z Wu - arxiv preprint arxiv:1904.06063, 2019 - arxiv.org
When deploying a Chinese neural text-to-speech (TTS) synthesis system, one of the
challenges is to synthesize Chinese utterances with English phrases or words embedded …

Code-switched speech synthesis using bilingual phonetic posteriorgram with only monolingual corpora

Y Cao, S Liu, X Wu, S Kang, P Liu, Z Wu… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Synthesizing fluent code-switched (CS) speech with consistent voice using only
monolingual corpora is still a challenging task, since language alternation seldom occurs …

An efficient model for text-to-speech synthesis in Indian languages

SP Panda, AK Nayak - International Journal of Speech Technology, 2015 - Springer
Speech Synthesis deals with artificial production of speech and a text-to-speech system
(TTS) in this aspect converts natural language text into a spoken waveform or speech. There …

Enhancing Multilingual Speech Generation and Recognition Abilities in LLMs with Constructed Code-switched Data

J Xu, D Tan, J Wang, X Chen - arxiv preprint arxiv:2409.10969, 2024 - arxiv.org
While large language models (LLMs) have been explored in the speech domain for both
generation and recognition tasks, their applications are predominantly confined to the …

[PDF][PDF] Cross-lingual voice conversion-based polyglot speech synthesizer for indian languages.

B Ramani, MPA Jeeva, P Vijayalakshmi… - …, 2014 - isca-archive.org
A polyglot speech synthesizer, synthesizes speech for any given monolingual or multilingual
text, in a single speaker's voice. In this regard, a polyglot speech corpus is required. It is …

[PDF][PDF] Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis.

R Fu, J Tao, Z Wen, J Yi, C Qiang, T Wang - INTERSPEECH, 2020 - isca-archive.org
Most of current end-to-end speech synthesis assumes the input text is in a single language
situation. However, codeswitching in speech occurs frequently in routine life, in which …

[PDF][PDF] A polyglot domain optimised text-to-speech system for railway station announcements.

C Zainkó, M Bartalis, G Németh, G Olaszy - INTERSPEECH, 2015 - researchgate.net
Announcements at railway stations are a major information source for passengers. In order
to ensure high intelligibility, the traditional solution is to use recorded prompts with “slot …

Polyglot speech synthesis: a review

B Sharma, SRM Prasanna - IETE Technical Review, 2017 - Taylor & Francis
The term polyglot speech synthesis refers to the process of producing speech in multiple
languages and single speaker's voice from a single text-to-speech synthesis (TTS) system …