Promptvc: Flexible stylistic voice conversion in latent space driven by natural language prompts

J Yao, Y Yang, Y Lei, Z Ning, Y Hu… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Stylistic voice conversion aims to transform the style of source speech to a desired style
according to real-world application demands. However, the current style voice conversion …

[HTML][HTML] Scalability and diversity of StarGANv2-VC in Arabic emotional voice conversion: Overcoming data limitations and enhancing performance

AH Meftah, YA Alotaibi, SA Selouani - Journal of King Saud University …, 2024 - Elsevier
Abstract Emotional Voice Conversion (EVC) for under-resourced languages like Arabic
faces challenges due to limited emotional speech data. This study explored strategies to …

Emotion modelling for speech generation

K Zhou - 2023 - search.proquest.com
Speech generation aims to synthesize human-like voices from the input of text or speech.
Current speech generation techniques can generate high quality, natural-sounding speech …

AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis

H Viswanath, A Bhattacharya, P Jutras-Dubé… - arxiv preprint arxiv …, 2023 - arxiv.org
Affect is an emotional characteristic encompassing valence, arousal, and intensity, and is a
crucial attribute for enabling authentic conversations. While existing text-to-speech (TTS) …