Describe Where You Are: Improving Noise-Robustness for Speech Emotion Recognition with Text Description of the Environment

SG Leem, D Fulford, JP Onnela, D Gard… - arxiv preprint arxiv …, 2024 - arxiv.org
Speech emotion recognition (SER) systems often struggle in real-world environments,
where ambient noise severely degrades their performance. This paper explores a novel …

A Reliable speech emotion recognition framework for multi-regional languages using optimized light gradient boosting machine classifier

S Radhika, A Prasanth, KKD Sowndarya - Biomedical Signal Processing …, 2025 - Elsevier
In today's world, the interpretation of emotions from human speech has prompted a lot of
research attention in signal processing applications. Many speech recognition approaches …

Speech Emotion Recognition under Noisy Environments with SNR Down to− 6 dB Using Multi-Decoder Wave-U-Net

HJ Nam, HJ Park - Applied Sciences, 2024 - mdpi.com
A speech emotion recognition (SER) model for noisy environments is proposed, by using
four band-pass filtered speech waveforms as the model input instead of the simplified input …

[PDF][PDF] Keep, Delete, or Substitute: Frame Selection Strategy for Noise-Robust Speech Emotion Recognition

SG Leem, D Fulford, JP Onnela, D Gard… - Proc. Interspeech …, 2024 - isca-archive.org
Speech emotion recognition (SER) system can exploit an Speech enhancement (SE) model
to increase its noise robustness by suppressing the background noise. However, SE could …

Enhancing Emotion Recognition in Conversation Through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning

H Shi, X Zhang, N Cheng, Y Zhang, J Yu, J **ao… - … on Intelligent Computing, 2024 - Springer
The purpose of emotion recognition in conversation (ERC) is to identify the emotion category
of an utterance based on contextual information. Previous ERC methods relied on simple …

[PDF][PDF] Reinforcement Learning based Data Augmentation for Noise Robust Speech Emotion Recognition

S Ranjan, R Chakraborty, SK Kopparapu - Proc. Interspeech 2024, 2024 - isca-archive.org
Speech emotion recognition (SER) is an indispensable component of any human machine
interactions, and enables building empathetic voice user interfaces. Ability to accurately …

HuBERT-CLAP: Contrastive Learning-Based Multimodal Emotion Recognition using Self-Alignment Approach

LH Nguyen, NT Pham, M Khan, A Othmani… - Proceedings of the 6th …, 2024 - dl.acm.org
A breakthrough in deep learning has led to improvements in speech emotion recognition
(SER), but these studies tend to process fixed-length segments, resulting in degraded …

Joint enhancement and classification constraints for noisy speech emotion recognition

L Sun, Y Lei, S Wang, S Chen, M Zhao, P Li - Digital Signal Processing, 2024 - Elsevier
In the natural environment, the received speech signal is often interfered by noise, which
reduces the performance of speech emotion recognition (SER) system. To this end, a noisy …

Emotion Sense:-Real-time Speech Emotion Recognition for live calls

S Buddha, R Sawant, SS Ingle… - … on Advances in …, 2024 - ieeexplore.ieee.org
Emotion Sense for live call is a technology that is designed to detect emotions of the user
like happiness, sadness, anger, fear, etc. It is a Speech Emotion Recognition (SER) model …

Personified Emotion Detection from Speech using Supervised Machine Learning

S Hota, P Chaudhury, S Kalia… - … in Technology (IC …, 2024 - ieeexplore.ieee.org
Speech-based emotion recognition in machine learning is an emerging topic. In our daily
routine, whatever speech we communicate can be based on our mood or emotion that we …