EMO-SUPERB: An in-depth look at speech emotion recognition

H Wu, HC Chou, KW Chang, L Goncalves, J Du… - arxiv preprint arxiv …, 2024 - arxiv.org
Speech emotion recognition (SER) is a pivotal technology for human-computer interaction
systems. However, 80.77% of SER papers yield results that cannot be reproduced. We …

Open-Emotion: A Reproducible EMO-Superb For Speech Emotion Recognition Systems

H Wu, HC Chou, KW Chang… - 2024 IEEE Spoken …, 2024 - ieeexplore.ieee.org
Speech emotion recognition (SER) is an essential technology for human-computer
interaction systems. However, the previous study reveals that 80.77% of SER papers yield …

Emo-bias: A large scale evaluation of social bias on speech emotion recognition

YC Lin, H Wu, HC Chou, CC Lee, H Lee - arxiv preprint arxiv:2406.05065, 2024 - arxiv.org
The rapid growth of Speech Emotion Recognition (SER) has diverse global applications,
from improving human-computer interactions to aiding mental health diagnostics. However …

Versatile audio-visual learning for emotion recognition

L Goncalves, SG Leem, WC Lin… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Most current audio-visual emotion recognition models lack the flexibility needed for
deployment in practical applications. We envision a multimodal system that works even …

A layer-anchoring strategy for enhancing cross-lingual speech emotion recognition

SG Upadhyay, C Busso, CC Lee - arxiv preprint arxiv:2407.04966, 2024 - arxiv.org
Cross-lingual speech emotion recognition (SER) is important for a wide range of everyday
applications. While recent SER research relies heavily on large pretrained models for …

Learning with rater-expanded label space to improve speech emotion recognition

SG Upadhyay, WS Chien, BH Su… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Automatic sensing of emotional information in speech is important for numerous everyday
applications. Conventional Speech Emotion Recognition (SER) models rely on averaging or …

Balancing speaker-rater fairness for gender-neutral speech emotion recognition

WS Chien, SG Upadhyay… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
Speech emotion recognition (SER) adds to the humane aspects of voice technologies to
enhance user experiences. The ground truth emotion annotations provided by human raters …

Describe Where You Are: Improving Noise-Robustness for Speech Emotion Recognition with Text Description of the Environment

SG Leem, D Fulford, JP Onnela, D Gard… - arxiv preprint arxiv …, 2024 - arxiv.org
Speech emotion recognition (SER) systems often struggle in real-world environments,
where ambient noise severely degrades their performance. This paper explores a novel …

[PDF][PDF] Bridging emotions across languages: Low rank adaptation for multilingual speech emotion recognition

L Goncalves, D Robinson, E Richerson… - Proc. Interspeech …, 2024 - ecs.utdallas.edu
The field of speech emotion recognition (SER) is constantly evolving with the surge in voice
data and linguistic diversity. This growth highlights the need for SER systems capable of …

Embracing Ambiguity And Subjectivity Using The All-Inclusive Aggregation Rule For Evaluating Multi-Label Speech Emotion Recognition Systems

HC Chou, H Wu, L Goncalves, SG Leem… - 2024 IEEE Spoken …, 2024 - ieeexplore.ieee.org
Speech Emotion Recognition (SER) faces a distinct challenge compared to other speech-
related tasks because the annotations will show the subjective emotional perceptions of …