Information screening whilst exploiting! multimodal relation extraction with feature denoising and multimodal topic modeling

S Wu, H Fei, Y Cao, L Bing, TS Chua - arxiv preprint arxiv:2305.11719, 2023 - arxiv.org
Existing research on multimodal relation extraction (MRE) faces two co-existing challenges,
internal-information over-utilization and external-information under-exploitation. To combat …

Multimodal fusion on low-quality data: A comprehensive survey

Q Zhang, Y Wei, Z Han, H Fu, X Peng, C Deng… - arxiv preprint arxiv …, 2024 - arxiv.org
Multimodal fusion focuses on integrating information from multiple modalities with the goal of
more accurate prediction, which has achieved remarkable progress in a wide range of …

Enhancing multimodal entity and relation extraction with variational information bottleneck

S Cui, J Cao, X Cong, J Sheng, Q Li… - … /ACM Transactions on …, 2024 - ieeexplore.ieee.org
This article studies the multimodal named entity recognition (MNER) and multimodal relation
extraction (MRE), which are important for content analysis and various applications. The …

M2DF: Multi-grained Multi-curriculum Denoising Framework for Multimodal Aspect-based Sentiment Analysis

F Zhao, C Li, Z Wu, Y Ouyang, J Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Multimodal Aspect-based Sentiment Analysis (MABSA) is a fine-grained Sentiment Analysis
task, which has attracted growing research interests recently. Existing work mainly utilizes …

Owner name entity recognition in websites based on multiscale features and multimodal co-attention

Y Ren, H Li, P Liu, J Liu, H Zhu, L Sun - Expert Systems with Applications, 2023 - Elsevier
Identifying the owners of online devices on the Internet can enable numerous network
security applications. For example, fast and accurate Owner Name Entity Recognition …

MM-BigBench: Evaluating Multimodal Models on Multimodal Content Comprehension Tasks

X Yang, W Wu, S Feng, M Wang, D Wang, Y Li… - arxiv preprint arxiv …, 2023 - arxiv.org
The popularity of multimodal large language models (MLLMs) has triggered a recent surge
in research efforts dedicated to evaluating these models. Nevertheless, existing evaluation …

I2SRM: Intra-and Inter-Sample Relationship Modeling for Multimodal Information Extraction

Y Huang, Z Lin - Proceedings of the 5th ACM International Conference …, 2023 - dl.acm.org
Multimodal information extraction is attracting research attention nowadays, which requires
aggregating representations from different modalities. In this paper, we present the Intra-and …

MORE: A Multimodal Object-Entity Relation Extraction Dataset with a Benchmark Evaluation

L He, H Wang, Y Cao, Z Wu, J Zhang… - Proceedings of the 31st …, 2023 - dl.acm.org
Extracting relational facts from multimodal data is a crucial task in the field of multimedia and
knowledge graphs that feeds into widespread real-world applications. The emphasis of …

TCMT: Target-oriented Cross Modal Transformer for Multimodal Aspect-Based Sentiment Analysis

W Zou, X Sun, W Wu, Q Lu, X Zhao, Q Bo… - Expert Systems with …, 2025 - Elsevier
Abstract Multimodal Aspect-Based Sentiment Analysis (MABSA) technology aims to utilize
both textual and visual modalities to achieve Multimodal Aspect Term Extraction (MATE) and …

A unified visual prompt tuning framework with mixture-of-experts for multimodal information extraction

B Xu, S Huang, M Du, H Wang, H Song, Y **ao… - … on Database Systems …, 2023 - Springer
Recently, multimodal information extraction has gained increasing attention in social media
understanding, as it helps to accomplish the task of information extraction by adding images …