Emotional voice conversion: Theory, databases and esd

K Zhou, B Sisman, R Liu, H Li - Speech Communication, 2022‏ - Elsevier
In this paper, we first provide a review of the state-of-the-art emotional voice conversion
research, and the existing emotional speech databases. We then motivate the development …

[HTML][HTML] KBES: A dataset for realistic Bangla speech emotion recognition with intensity level

MM Billah, ML Sarker, MAH Akhand - Data in Brief, 2023‏ - Elsevier
Abstract Speech Emotion Recognition (SER) identifies and categorizes emotional states by
analyzing speech signals. SER is an emerging research area using machine learning and …

DeepEZ: a graph convolutional network for automated epileptogenic zone localization from resting-state fMRI connectivity

N Nandakumar, D Hsu, R Ahmed… - IEEE Transactions on …, 2022‏ - ieeexplore.ieee.org
Objective: Epileptogenic zone (EZ) localization is a crucial step during diagnostic work up
and therapeutic planning in medication refractory epilepsy. In this paper, we present the first …

[HTML][HTML] Map** discrete emotions in the dimensional space: An acoustic approach

M Trnka, S Darjaa, M Ritomský, R Sabo, M Rusko… - Electronics, 2021‏ - mdpi.com
A frequently used procedure to examine the relationship between categorical and
dimensional descriptions of emotions is to ask subjects to place verbal expressions …

Emotions and virality: Social transmission of political messages on Twitter

N Pivecka, RA Ratzinger, A Florack - Frontiers in psychology, 2022‏ - frontiersin.org
Drawing on previous literature that valence and arousal constitute the fundamental
properties of emotions and that emotional content is a determinant of social transmission …

Non-parallel emotion conversion using a deep-generative hybrid network and an adversarial pair discriminator

R Shankar, J Sager, A Venkataraman - arxiv preprint arxiv:2007.12932, 2020‏ - arxiv.org
We introduce a novel method for emotion conversion in speech that does not require
parallel training data. Our approach loosely relies on a cycle-GAN schema to minimize the …

You're Not You When You're Angry: Robust Emotion Features Emerge by Recognizing Speakers

Z Aldeneh, EM Provost - IEEE Transactions on Affective …, 2021‏ - ieeexplore.ieee.org
The robustness of an acoustic emotion recognition system hinges on first having access to
features that represent an acoustic input signal. These representations should abstract …

[PDF][PDF] A Multi-Speaker Emotion Morphing Model Using Highway Networks and Maximum Likelihood Objective.

R Shankar, J Sager, A Venkataraman - INTERSPEECH, 2019‏ - researchgate.net
We introduce a new model for emotion conversion in speech based on highway neural
networks. Our model uses the contextual pitch, energy and spectral information of a source …

Emotion modelling for speech generation

K Zhou - 2023‏ - search.proquest.com
Speech generation aims to synthesize human-like voices from the input of text or speech.
Current speech generation techniques can generate high quality, natural-sounding speech …

A comparative study of data augmentation techniques for deep learning based emotion recognition

R Shankar, AH Kenfack, A Somayazulu… - arxiv preprint arxiv …, 2022‏ - arxiv.org
Automated emotion recognition in speech is a long-standing problem. While early work on
emotion recognition relied on hand-crafted features and simple classifiers, the field has now …