A survey of deep learning-based multimodal emotion recognition: Speech, text, and face

H Lian, C Lu, S Li, Y Zhao, C Tang, Y Zong - Entropy, 2023 - mdpi.com
Multimodal emotion recognition (MER) refers to the identification and understanding of
human emotional states by combining different signals, including—but not limited to—text …

Multimodal emotion recognition: A comprehensive review, trends, and challenges

MPA Ramaswamy… - … Reviews: Data Mining and …, 2024 - Wiley Online Library
Automatic emotion recognition is a burgeoning field of research and has its roots in
psychology and cognitive science. This article comprehensively reviews multimodal emotion …

[HTML][HTML] Enhancing Human Activity Recognition through Integrated Multimodal Analysis: A Focus on RGB Imaging, Skeletal Tracking, and Pose Estimation

SU Rehman, AU Yasin, E Ul Haq, M Ali, J Kim… - Sensors, 2024 - mdpi.com
Human activity recognition (HAR) is pivotal in advancing applications ranging from
healthcare monitoring to interactive gaming. Traditional HAR systems, primarily relying on …

STP-MFM: semi-tensor product-based multi-modal factorized multilinear pooling for information fusion in sentiment analysis

F Liu, J Chen, K Li, J Bai, W Tan, C Cai… - Digital Signal Processing, 2024 - Elsevier
Multi-modal fusion can exploit complementary information from various modalities and
improve the accuracy of prediction or classification tasks. In this paper, we propose a semi …

Using Large Language Models for education managements in Vietnamese with low resources

DD Minh, VN Van, TD Cong - arxiv preprint arxiv:2501.15022, 2025 - arxiv.org
Large language models (LLMs), such as GPT-4, Gemini 1.5, Claude 3.5 Sonnet, and
Llama3, have demonstrated significant advancements in various NLP tasks since the …

[HTML][HTML] Advances in Uncertain Information Fusion

L Jiao - Entropy, 2024 - mdpi.com
Information fusion is the combination of information from multiple sources, which aims to
draw more comprehensive, specific, and accurate inferences about the world than are …

Granularity Based Inter and Intra-Modal Fusion Network for Sarcasm Detection

Y Shi, X Zhao, M Chen - 2023 - researchsquare.com
Multi-modal sarcasm detection is a task that involves detecting and identifyingsarcasm using
multiple modalities of information. The key aspect of this task lies in how to model intra and …

Cross-dimensional Principal Component Analysis-A New Dimensionality Reduction Method

N Fan, L Zhang, R Liu - 2024 China Automation Congress …, 2024 - ieeexplore.ieee.org
cross-dimensional principal component analysis (CD-PCA). It is based on the semi-tensor
product of matrices theory (STP), where a new projection rule is introduced to reduce …