A survey of information extraction based on deep learning

Y Yang, Z Wu, Y Yang, S Lian, F Guo, Z Wang - Applied Sciences, 2022 - mdpi.com
As a core task and an important link in the fields of natural language understanding and
information retrieval, information extraction (IE) can structure and semanticize unstructured …

Knowledge graphs meet multi-modal learning: A comprehensive survey

Z Chen, Y Zhang, Y Fang, Y Geng, L Guo… - arxiv preprint arxiv …, 2024 - arxiv.org
Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the
semantic web community's exploration into multi-modal dimensions unlocking new avenues …

Multi-modal knowledge graph construction and application: A survey

X Zhu, Z Li, X Wang, X Jiang, P Sun… - … on Knowledge and …, 2022 - ieeexplore.ieee.org
Recent years have witnessed the resurgence of knowledge engineering which is featured
by the fast growth of knowledge graphs. However, most of existing knowledge graphs are …

Multimodal aspect-based sentiment analysis: a survey of tasks, methods, challenges and future directions

T Zhao, L Meng, D Song - Information Fusion, 2024 - Elsevier
With the development of social media, users increasingly tend to express their sentiments
(broadly including sentiment polarities, emotions and sarcasm, etc.) associated with fine …

Chain-of-thought prompt distillation for multimodal named entity and multimodal relation extraction

F Chen, Y Feng - arxiv preprint arxiv:2306.14122, 2023 - arxiv.org
Multimodal Named Entity Recognition (MNER) and Multimodal Relation Extraction (MRE)
necessitate the fundamental reasoning capacity for intricate linguistic and multimodal …

Umie: Unified multimodal information extraction with instruction tuning

L Sun, K Zhang, Q Li, R Lou - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Multimodal information extraction (MIE) gains significant attention as the popularity of
multimedia content increases. However, current MIE methods often resort to using task …

Multi-granularity cross-modal representation learning for named entity recognition on social media

P Liu, G Wang, H Li, J Liu, Y Ren, H Zhu… - Information Processing & …, 2024 - Elsevier
With social media posts tending to be multimodal, Multimodal Named Entity Recognition
(MNER) for the text with its accompanying image is attracting more and more attention since …

Learning implicit entity-object relations by bidirectional generative alignment for multimodal ner

F Chen, J Liu, K Ji, W Ren, J Wang… - Proceedings of the 31st …, 2023 - dl.acm.org
The challenge posed by multimodal named entity recognition (MNER) is mainly two-fold:(1)
bridging the semantic gap between text and image and (2) matching the entity with its …

ICKA: an instruction construction and knowledge alignment framework for multimodal named entity recognition

Q Zeng, M Yuan, J Wan, K Wang, N Shi, Q Che… - Expert Systems with …, 2024 - Elsevier
Abstract Multimodal Named Entity Recognition (MNER) aims to identify entities of predefined
types in text by leveraging information from multiple modalities, most notably textual and …

Prompting chatgpt in MNER: enhanced multimodal named entity recognition with auxiliary refined knowledge

J Li, H Li, Z Pan, D Sun, J Wang, W Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Multimodal Named Entity Recognition (MNER) on social media aims to enhance textual
entity prediction by incorporating image-based clues. Existing studies mainly focus on …