A review on speech emotion recognition using deep learning and attention mechanism

E Lieskovská, M Jakubec, R Jarina, M Chmulík - Electronics, 2021 - mdpi.com
Emotions are an integral part of human interactions and are significant factors in determining
user satisfaction or customer opinion. speech emotion recognition (SER) modules also play …

End-to-end multimodal emotion recognition using deep neural networks

P Tzirakis, G Trigeorgis, MA Nicolaou… - IEEE Journal of …, 2017 - ieeexplore.ieee.org
Automatic affect recognition is a challenging task due to the various modalities emotions can
be expressed with. Applications can be found in many domains including multimedia …

Attention based fully convolutional network for speech emotion recognition

Y Zhang, J Du, Z Wang, J Zhang… - 2018 Asia-Pacific Signal …, 2018 - ieeexplore.ieee.org
Speech emotion recognition is a challenging task for three main reasons: 1) human emotion
is abstract, which means it is hard to distinguish; 2) in general, human emotion can only be …

Big data for industry 4.0: A conceptual framework

MO Gokalp, K Kayabay, MA Akyol… - 2016 international …, 2016 - ieeexplore.ieee.org
Exponential growth in data volume originating from Internet of Things sources and
information services drives the industry to develop new models and distributed tools to …

A multi-task learning framework for emotion recognition using 2D continuous space

R **a, Y Liu - IEEE Transactions on affective computing, 2015 - ieeexplore.ieee.org
Dimensional models have been proposed in psychology studies to represent complex
human emotional expressions. Activation and valence are two common dimensions in such …

Multi-task semi-supervised adversarial autoencoding for speech emotion recognition

S Latif, R Rana, S Khalifa, R Jurdak… - IEEE Transactions …, 2020 - ieeexplore.ieee.org
Inspite the emerging importance of Speech Emotion Recognition (SER), the state-of-the-art
accuracy is quite low and needs improvement to make commercial applications of SER …

Multi-head attention fusion networks for multi-modal speech emotion recognition

J Zhang, L **ng, Z Tan, H Wang, K Wang - Computers & Industrial …, 2022 - Elsevier
Multi-modal speech emotion recognition is a study to predict emotion categories by
combining speech data with other types of data, such as video, speech text transcription …

Multi-channel weight-sharing autoencoder based on cascade multi-head attention for multimodal emotion recognition

J Zheng, S Zhang, Z Wang, X Wang… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Multimodal Emotion Recognition is challenging because of the heterogeneity gap among
different modalities. Due to the powerful ability of feature abstraction, Deep Neural Networks …

Convolutional attention networks for multimodal emotion recognition from speech and text data

WY Choi, KY Song, CW Lee - Proceedings of grand challenge …, 2018 - aclanthology.org
Emotion recognition has become a popular topic of interest, especially in the field of human
computer interaction. Previous works involve unimodal analysis of emotion, while recent …