Qi-tts: Questioning intonation control for emotional speech synthesis

H Tang, X Zhang, J Wang, N Cheng… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Recent expressive text to speech (TTS) models focus on synthesizing emotional speech, but
some fine-grained styles such as intonation are neglected. In this paper, we propose QI-TTS …

[PDF][PDF] Exploiting Emotion Information in Speaker Embeddings for Expressive Text-to-Speech

Z Shaheen, T Sadekova, Y Matveeva… - INTERSPEECH …, 2023 - isca-archive.org
Abstract Text-to-Speech (TTS) systems have recently seen great progress in synthesizing
high-quality speech. However, the prosody of generated utterances often is not as diverse …

PRESENT: Zero-Shot Text-to-Prosody Control

P Lam, H Zhang, NF Chen, B Sisman… - IEEE Signal …, 2025 - ieeexplore.ieee.org
Current strategies for achieving fine-grained prosody control in speech synthesis entail
extracting additional style embeddings or adopting more complex architectures. To enable …

Continuous Sign Language Recognition and Its Translation into Intonation-Colored Speech

N Amangeldy, A Ukenova, G Bekmanova… - Sensors, 2023 - mdpi.com
This article is devoted to solving the problem of converting sign language into a consistent
text with intonation markup for subsequent voice synthesis of sign phrases by speech with …