An engineering view on emotions and speech: From analysis and predictive models to responsible human-centered applications

CC Lee, T Chaspari, EM Provost… - Proceedings of the …, 2023 - ieeexplore.ieee.org
The substantial growth of Internet-of-Things technology and the ubiquity of smartphone
devices has increased the public and industry focus on speech emotion recognition (SER) …

Open-Emotion: A Reproducible EMO-Superb For Speech Emotion Recognition Systems

H Wu, HC Chou, KW Chang… - 2024 IEEE Spoken …, 2024 - ieeexplore.ieee.org
Speech emotion recognition (SER) is an essential technology for human-computer
interaction systems. However, the previous study reveals that 80.77% of SER papers yield …

EMO-SUPERB: An in-depth look at speech emotion recognition

H Wu, HC Chou, KW Chang, L Goncalves, J Du… - arxiv preprint arxiv …, 2024 - arxiv.org
Speech emotion recognition (SER) is a pivotal technology for human-computer interaction
systems. However, 80.77% of SER papers yield results that cannot be reproduced. We …

Estimating the uncertainty in emotion attributes using deep evidential regression

W Wu, C Zhang, PC Woodland - arxiv preprint arxiv:2306.06760, 2023 - arxiv.org
In automatic emotion recognition (AER), labels assigned by different human annotators to
the same utterance are often inconsistent due to the inherent complexity of emotion and the …

Exploiting co-occurrence frequency of emotions in perceptual evaluations to train a speech emotion classifier

HC Chou, CC Lee, C Busso - Interspeech 2022, 2022 - par.nsf.gov
Previous studies on speech emotion recognition (SER) with categorical emotions have often
formulated the task as a single-label classification problem, where the emotions are …

[HTML][HTML] Deep temporal clustering features for speech emotion recognition

WC Lin, C Busso - Speech Communication, 2024 - Elsevier
Deep clustering is a popular unsupervised technique for feature representation learning. We
recently proposed the chunk-based DeepEmoCluster framework for speech emotion …

Learning With Rater-Expanded Label Space to Improve Speech Emotion Recognition

SG Upadhyay, WS Chien, BH Su… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Automatic sensing of emotional information in speech is important for numerous everyday
applications. Conventional Speech Emotion Recognition (SER) models rely on averaging or …

Disentangling prosody representations with unsupervised speech reconstruction

L Qu, T Li, C Weber, T Pekarek-Rosin… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
Human speech can be characterized by different components, including semantic content,
speaker identity and prosodic information. Significant progress has been made in …

Extending speech emotion recognition systems to non-prototypical emotions using mixed-emotion model

P Kumawat, A Routray - Expert Systems with Applications, 2025 - Elsevier
In the conventional approach to speech emotion recognition (SER), the classifier is usually
trained on acted emotional speech data to predict individual basic emotions. In this work, we …

Subjective evaluation of basic emotions from audio–visual data

SR Kadiri, P Alku - Sensors, 2022 - mdpi.com
Understanding of the perception of emotions or affective states in humans is important to
develop emotion-aware systems that work in realistic scenarios. In this paper, the perception …