Towards realistic emotional voice conversion using controllable emotional intensity
Realistic emotional voice conversion (EVC) aims to enhance emotional diversity of
converted audios, making the synthesized voices more authentic and natural. To this end …
converted audios, making the synthesized voices more authentic and natural. To this end …
Enhancing Automatic Speech Recognition for Punjabi Dialects: An Experimental Analysis of Incorporating Prosodic Features and Acoustic Variability Mitigation
Abstract The development of Automatic Speech Recognition (ASR) systems has varied
significantly across the roughly 6500 languages that make up the world's spoken languages …
significantly across the roughly 6500 languages that make up the world's spoken languages …
[PDF][PDF] Boosting Cross-Corpus Speech Emotion Recognition using CycleGAN with Contrastive Learning
The premise for the success of most classic speech emotion recognition (SER) algorithms is
that training and testing samples are independent and identically distributed. However, the …
that training and testing samples are independent and identically distributed. However, the …
[PDF][PDF] Confidence-aware Hypothesis Transfer Networks for Source-Free Cross-Corpus Speech Emotion Recognition
The goal of Source-free cross-corpus speech emotion recognition (SER) is to transfer
emotion knowledge from source corpus to target one without access to source data. To …
emotion knowledge from source corpus to target one without access to source data. To …
Comparative Analysis of Voice Conversion in German
K Schäfer, JE Choi, M Steinebach - International Conference on Pattern …, 2025 - Springer
Voice Conversion (VC) has gained attention due to its rapid development and increased
accessibility. However, this also brings a potential threat for misuses. Consequently, it is …
accessibility. However, this also brings a potential threat for misuses. Consequently, it is …