Advanced Deep Learning Models For Emotion Detection In Speech: Applying The Ravdess Dataset

GDP Aryono, D Ferawati… - Jurasik (Jurnal Riset …, 2024 - ejurnal.tunasbangsa.ac.id
This study introduces a comprehensive approach to emotion recognition in speech using the
Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS). The method …

Comparative analysis of multi-loss functions for enhanced multi-modal speech emotion recognition

PN Tran, TDT Vu, NT Pham… - … on Information and …, 2023 - ieeexplore.ieee.org
In recent years, multi-modal analysis has gained significant prominence across domains
such as audio/speech processing, natural language processing, and affective computing …

Mersa: Multimodal emotion recognition with self-align embedding

QB Le, KT Trinh, NDH Son, PN Tran… - 2024 International …, 2024 - ieeexplore.ieee.org
Emotions are an integral part of human communication and interaction, significantly sha**
our social connections, decision-making, and overall well-being. Understanding and …

SER-Fuse: An Emotion Recognition Application Utilizing Multi-Modal, Multi-Lingual, and Multi-Feature Fusion

NT Pham, LT Phan, DNM Dang… - Proceedings of the 12th …, 2023 - dl.acm.org
Speech emotion recognition (SER) is a crucial aspect of affective computing and human-
computer interaction, yet effectively identifying emotions in different speakers and languages …

Enhancing Speech Emotion Recognition Through Knowledge Distillation

TM Nguyen, PN Tran, DNM Dang - 2024 15th International …, 2024 - ieeexplore.ieee.org
The importance of Speech Emotion Recognition (SER) is growing across diverse
applications, which has resulted in the development of multiple methodologies and models …