A survey of deep learning-based multimodal emotion recognition: Speech, text, and face
Multimodal emotion recognition (MER) refers to the identification and understanding of
human emotional states by combining different signals, including—but not limited to—text …
human emotional states by combining different signals, including—but not limited to—text …
Modeling Speech Emotion Recognition via Attention-Oriented Parallel CNN Encoders
Meticulous learning of human emotions through speech is an indispensable function of
modern speech emotion recognition (SER) models. Consequently, deriving and interpreting …
modern speech emotion recognition (SER) models. Consequently, deriving and interpreting …
Exploring corpus-invariant emotional acoustic feature for cross-corpus speech emotion recognition
Unsupervised cross-corpus speech emotion recognition (SER) is the task where the labeled
training (source) and unlabeled testing (target) speech come from different corpora …
training (source) and unlabeled testing (target) speech come from different corpora …
[PDF][PDF] Dual memory fusion for multimodal speech emotion recognition
Deep learning has been widely used in multi-modal Speech Emotion Recognition (SER) to
learn sentiment-related features by aggregating representations from multiple modes …
learn sentiment-related features by aggregating representations from multiple modes …
Sia-net: Sparse interactive attention network for multimodal emotion recognition
S Li, T Zhang, CLP Chen - IEEE Transactions on Computational …, 2024 - ieeexplore.ieee.org
Multimodal emotion recognition (MER) integrates multiple modalities to identify the user's
emotional state, which is the core technology of natural and friendly human–computer …
emotional state, which is the core technology of natural and friendly human–computer …
STIDNet: Identity-aware face forgery detection with spatiotemporal knowledge distillation
The impressive development of facial manipulation techniques has raised severe public
concerns. Identity-aware methods, especially suitable for protecting celebrities, are seen as …
concerns. Identity-aware methods, especially suitable for protecting celebrities, are seen as …
Multimodal emotion recognition in noisy environment based on progressive label revision
The multimodal emotion recognition has attracted more attention in recent decades. Though
remarkable progress has been achieved with the rapid development of deep learning …
remarkable progress has been achieved with the rapid development of deep learning …
Synchronization of fractional-order neural networks with inertia terms via cumulative reduced-order method
L Hu, H Jiang, C Hu, Y Ren, S Chen - Neurocomputing, 2025 - Elsevier
This work investigates the synchronization issue of fractional-order inertial neural networks
(FOINNs) by employing a cumulative reduced-order method. Firstly, a novel cumulative …
(FOINNs) by employing a cumulative reduced-order method. Firstly, a novel cumulative …
Decoupled and Explainable Associative Memory for Effective Knowledge Propagation
Long-term memory often plays a pivotal role in human cognition through the analysis of
contextual information. Machine learning researchers have attempted to emulate this …
contextual information. Machine learning researchers have attempted to emulate this …
AMGCN: An adaptive multi-graph convolutional network for speech emotion recognition
Speech contains rich emotional information, especially in its time and frequency domains.
Therefore, extracting emotional information from these domains to model the global …
Therefore, extracting emotional information from these domains to model the global …