A survey of deep learning-based multimodal emotion recognition: Speech, text, and face

H Lian, C Lu, S Li, Y Zhao, C Tang, Y Zong - Entropy, 2023 - mdpi.com
Multimodal emotion recognition (MER) refers to the identification and understanding of
human emotional states by combining different signals, including—but not limited to—text …

Modeling Speech Emotion Recognition via Attention-Oriented Parallel CNN Encoders

F Makhmudov, A Kutlimuratov, F Akhmedov… - Electronics, 2022 - mdpi.com
Meticulous learning of human emotions through speech is an indispensable function of
modern speech emotion recognition (SER) models. Consequently, deriving and interpreting …

Exploring corpus-invariant emotional acoustic feature for cross-corpus speech emotion recognition

H Lian, C Lu, Y Zhao, S Li, T Qi, Y Zong - Expert Systems with Applications, 2024 - Elsevier
Unsupervised cross-corpus speech emotion recognition (SER) is the task where the labeled
training (source) and unlabeled testing (target) speech come from different corpora …

[PDF][PDF] Dual memory fusion for multimodal speech emotion recognition

D Priyasad, T Fernando, S Sridharan… - Proc …, 2023 - isca-archive.org
Deep learning has been widely used in multi-modal Speech Emotion Recognition (SER) to
learn sentiment-related features by aggregating representations from multiple modes …

Sia-net: Sparse interactive attention network for multimodal emotion recognition

S Li, T Zhang, CLP Chen - IEEE Transactions on Computational …, 2024 - ieeexplore.ieee.org
Multimodal emotion recognition (MER) integrates multiple modalities to identify the user's
emotional state, which is the core technology of natural and friendly human–computer …

STIDNet: Identity-aware face forgery detection with spatiotemporal knowledge distillation

M Fang, L Yu, H **e, Q Tan, Z Tan… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
The impressive development of facial manipulation techniques has raised severe public
concerns. Identity-aware methods, especially suitable for protecting celebrities, are seen as …

Multimodal emotion recognition in noisy environment based on progressive label revision

S Li, H Lian, C Lu, Y Zhao, C Tang, Y Zong… - Proceedings of the 31st …, 2023 - dl.acm.org
The multimodal emotion recognition has attracted more attention in recent decades. Though
remarkable progress has been achieved with the rapid development of deep learning …

Synchronization of fractional-order neural networks with inertia terms via cumulative reduced-order method

L Hu, H Jiang, C Hu, Y Ren, S Chen - Neurocomputing, 2025 - Elsevier
This work investigates the synchronization issue of fractional-order inertial neural networks
(FOINNs) by employing a cumulative reduced-order method. Firstly, a novel cumulative …

Decoupled and Explainable Associative Memory for Effective Knowledge Propagation

T Fernando, D Priyasad, S Sridharan… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Long-term memory often plays a pivotal role in human cognition through the analysis of
contextual information. Machine learning researchers have attempted to emulate this …

AMGCN: An adaptive multi-graph convolutional network for speech emotion recognition

H Lian, C Lu, H Chang, Y Zhao, S Li, Y Li… - Speech Communication, 2025 - Elsevier
Speech contains rich emotional information, especially in its time and frequency domains.
Therefore, extracting emotional information from these domains to model the global …