Adversarial alignment and graph fusion via information bottleneck for multimodal emotion recognition in conversations
With the rapid development of social media and human–computer interaction, multimodal
emotion recognition in conversations (MERC) tasks have begun to receive widespread …
emotion recognition in conversations (MERC) tasks have begun to receive widespread …
Adversarial representation with intra-modal and inter-modal graph contrastive learning for multimodal emotion recognition
With the release of increasing open-source emotion recognition datasets on social media
platforms and the rapid development of computing resources, multimodal emotion …
platforms and the rapid development of computing resources, multimodal emotion …
Variational causal inference network for explanatory visual question answering
Abstract Explanatory Visual Question Answering (EVQA) is a recently proposed multimodal
reasoning task that requires answering visual questions and generating multimodal …
reasoning task that requires answering visual questions and generating multimodal …
LDRE: LLM-based Divergent Reasoning and Ensemble for Zero-Shot Composed Image Retrieval
Zero-Shot Composed Image Retrieval (ZS-CIR) has garnered increasing interest in recent
years, which aims to retrieve a target image based on a query composed of a reference …
years, which aims to retrieve a target image based on a query composed of a reference …
Multi-level contrastive learning: Hierarchical alleviation of heterogeneity in multimodal sentiment analysis
Recently, multimodal fusion efforts have achieved remarkable success in Multimodal
Sentiment Analysis (MSA). However, most of the existing methods are based on model-level …
Sentiment Analysis (MSA). However, most of the existing methods are based on model-level …
A survey on cross-media search based on user intention understanding in social networks
With the increasing popularity of online social networks, more and more people are posting
information, updating their statuses, and searching for topics there. Massive cross-media big …
information, updating their statuses, and searching for topics there. Massive cross-media big …
Adversarial Graph Neural Network for Multivariate Time Series Anomaly Detection
Anomaly detection is one of the most significant tasks in multivariate time series analysis,
while it remains challenging to model complex patterns for improving detection accuracy …
while it remains challenging to model complex patterns for improving detection accuracy …
Open-world social event classification
With the rapid development of Internet and the expanding scale of social media, social event
classification has attracted increasing attention. The key to social event classification is …
classification has attracted increasing attention. The key to social event classification is …
EduCross: Dual adversarial bipartite hypergraph learning for cross-modal retrieval in multimodal educational slides
In the digital education landscape, cross-modal retrieval (CMR) from multimodal educational
slides represents a significant challenge, particularly because of the complex nature of …
slides represents a significant challenge, particularly because of the complex nature of …
Fine-grained Prototypical Voting with Heterogeneous Mixup for Semi-supervised 2D-3D Cross-modal Retrieval
This paper studies the problem of semi-supervised 2D-3D retrieval which aims to align both
labeled and unlabeled 2D and 3D data into the same embedding space. The problem is …
labeled and unlabeled 2D and 3D data into the same embedding space. The problem is …