A systematic review on affective computing: Emotion models, databases, and recent advances

Y Wang, W Song, W Tao, A Liotta, D Yang, X Li, S Gao… - Information …, 2022 - Elsevier
Affective computing conjoins the research topics of emotion recognition and sentiment
analysis, and can be realized with unimodal or multimodal data, consisting primarily of …

Emotion recognition from unimodal to multimodal analysis: A review

K Ezzameli, H Mahersia - Information Fusion, 2023 - Elsevier
The omnipresence of numerous information sources in our daily life brings up new
alternatives for emotion recognition in several domains including e-health, e-learning …

Dawn of the transformer era in speech emotion recognition: closing the valence gap

J Wagner, A Triantafyllopoulos… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Recent advances in transformer-based architectures have shown promise in several
machine learning tasks. In the audio domain, such architectures have been successfully …

Ast: Audio spectrogram transformer

Y Gong, YA Chung, J Glass - arxiv preprint arxiv:2104.01778, 2021 - arxiv.org
In the past decade, convolutional neural networks (CNNs) have been widely adopted as the
main building block for end-to-end audio classification models, which aim to learn a direct …

Federated learning in mobile edge networks: A comprehensive survey

WYB Lim, NC Luong, DT Hoang, Y Jiao… - … surveys & tutorials, 2020 - ieeexplore.ieee.org
In recent years, mobile devices are equipped with increasingly advanced sensing and
computing capabilities. Coupled with advancements in Deep Learning (DL), this opens up …

Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers

MB Akçay, K Oğuz - Speech Communication, 2020 - Elsevier
Speech is the most natural way of expressing ourselves as humans. It is only natural then to
extend this communication medium to computer applications. We define speech emotion …

Speech emotion recognition with deep convolutional neural networks

D Issa, MF Demirci, A Yazici - Biomedical Signal Processing and Control, 2020 - Elsevier
The speech emotion recognition (or, classification) is one of the most challenging topics in
data science. In this work, we introduce a new architecture, which extracts mel-frequency …

Speech emotion recognition using deep learning techniques: A review

RA Khalil, E Jones, MI Babar, T Jan, MH Zafar… - IEEE …, 2019 - ieeexplore.ieee.org
Emotion recognition from speech signals is an important but challenging component of
Human-Computer Interaction (HCI). In the literature of speech emotion recognition (SER) …

Multimodal language analysis in the wild: Cmu-mosei dataset and interpretable dynamic fusion graph

AAB Zadeh, PP Liang, S Poria, E Cambria… - Proceedings of the …, 2018 - aclanthology.org
Analyzing human multimodal language is an emerging area of research in NLP. Intrinsically
this language is multimodal (heterogeneous), sequential and asynchronous; it consists of …

Deep learning techniques for speech emotion recognition, from databases to models

BJ Abbaschian, D Sierra-Sosa, A Elmaghraby - Sensors, 2021 - mdpi.com
The advancements in neural networks and the on-demand need for accurate and near real-
time Speech Emotion Recognition (SER) in human–computer interactions make it …