A low-rank matching attention based cross-modal feature fusion method for conversational emotion recognition

Y Shou, H Liu, X Cao, D Meng… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Conversational emotion recognition (CER) is an important research topic in human-
computer interactions. Although recent advancements in transformer-based cross-modal …

Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis

L Sun, Z Lian, B Liu, J Tao - IEEE Transactions on Affective …, 2023 - ieeexplore.ieee.org
With the proliferation of user-generated online videos, Multimodal Sentiment Analysis (MSA)
has attracted increasing attention recently. Despite significant progress, there are still two …

Smin: Semi-supervised multi-modal interaction network for conversational emotion recognition

Z Lian, B Liu, J Tao - IEEE Transactions on Affective Computing, 2022 - ieeexplore.ieee.org
Conversational emotion recognition is a crucial research topic in human-computer
interactions. Due to the heavy annotation cost and inevitable label ambiguity, collecting …

SKEAFN: sentiment knowledge enhanced attention fusion network for multimodal sentiment analysis

C Zhu, M Chen, S Zhang, C Sun, H Liang, Y Liu… - Information …, 2023 - Elsevier
Multimodal sentiment analysis is an active research field that aims to recognize the user's
sentiment information from multimodal data. The primary challenge in this field is to develop …

Fc-kan: Function combinations in kolmogorov-arnold networks

HT Ta, DQ Thai, ABS Rahman, G Sidorov… - arxiv preprint arxiv …, 2024 - arxiv.org
In this paper, we introduce FC-KAN, a Kolmogorov-Arnold Network (KAN) that leverages
combinations of popular mathematical functions such as B-splines, wavelets, and radial …

Attention gated tensor neural network architectures for speech emotion recognition

SK Pandey, HS Shekhawat, SRM Prasanna - … Signal Processing and …, 2022 - Elsevier
In an attempt to make Human-Computer Interactions more natural, we propose the use of
Tensor Factorized Neural Networks (TFNN) and Attention Gated Tensor Factorized Neural …

Multi-granularity relational attention network for audio-visual question answering

L Li, T **, W Lin, H Jiang, W Pan… - … on Circuits and …, 2023 - ieeexplore.ieee.org
Recent methods for video question answering (VideoQA), aiming to generate answers
based on given questions and video content, have made significant progress in cross-modal …

Low-rank Prompt Interaction for Continual Vision-Language Retrieval

W Yan, Y Wang, W Lin, Z Guo, Z Zhao… - Proceedings of the 32nd …, 2024 - dl.acm.org
Research on continual learning in multi-modal tasks has been receiving increasing
attention. However, most existing work overlooks the explicit cross-modal and cross-task …

Pirnet: Personality-enhanced iterative refinement network for emotion recognition in conversation

Z Lian, B Liu, J Tao - IEEE Transactions on Neural Networks …, 2022 - ieeexplore.ieee.org
Emotion recognition in conversation (ERC) is important for enhancing user experience in
human–computer interaction. Unlike vanilla emotion recognition in individual utterances …

Real-time multimodal interaction in virtual reality-a case study with a large virtual interface

L Cao, H Zhang, C Peng, JT Hansberger - Multimedia Tools and …, 2023 - Springer
The values of VR and multimodal interaction technologies offer creative, virtual alternatives
to manipulate a large data set in a virtual environment. This work presents the design …