Towards realistic emotional voice conversion using controllable emotional intensity

T Qi, S Wang, C Lu, Y Zhao, Y Zong… - arxiv preprint arxiv …, 2024 - arxiv.org
Realistic emotional voice conversion (EVC) aims to enhance emotional diversity of
converted audios, making the synthesized voices more authentic and natural. To this end …

Enhancing Automatic Speech Recognition for Punjabi Dialects: An Experimental Analysis of Incorporating Prosodic Features and Acoustic Variability Mitigation

V Bhardwaj, T Gera, D Thakur, A Singh - SN Computer Science, 2024 - Springer
Abstract The development of Automatic Speech Recognition (ASR) systems has varied
significantly across the roughly 6500 languages that make up the world's spoken languages …

[PDF][PDF] Boosting Cross-Corpus Speech Emotion Recognition using CycleGAN with Contrastive Learning

J Wang, Y Zhao, C Lu, C Tang, S Li, Y Zong… - Proc. Interspeech …, 2024 - isca-archive.org
The premise for the success of most classic speech emotion recognition (SER) algorithms is
that training and testing samples are independent and identically distributed. However, the …

[PDF][PDF] Confidence-aware Hypothesis Transfer Networks for Source-Free Cross-Corpus Speech Emotion Recognition

J Wang, Y Zhao, C Lu, H Lian, H Chang… - Proc. Interspeech …, 2024 - isca-archive.org
The goal of Source-free cross-corpus speech emotion recognition (SER) is to transfer
emotion knowledge from source corpus to target one without access to source data. To …

Comparative Analysis of Voice Conversion in German

K Schäfer, JE Choi, M Steinebach - International Conference on Pattern …, 2025 - Springer
Voice Conversion (VC) has gained attention due to its rapid development and increased
accessibility. However, this also brings a potential threat for misuses. Consequently, it is …