Knowledge graphs meet multi-modal learning: A comprehensive survey

Z Chen, Y Zhang, Y Fang, Y Geng, L Guo… - arxiv preprint arxiv …, 2024 - arxiv.org
Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the
semantic web community's exploration into multi-modal dimensions unlocking new avenues …

A review on multimodal zero‐shot learning

W Cao, Y Wu, Y Sun, H Zhang, J Ren… - … : Data Mining and …, 2023 - Wiley Online Library
Multimodal learning provides a path to fully utilize all types of information related to the
modeling target to provide the model with a global vision. Zero‐shot learning (ZSL) is a …

Open-domain visual entity recognition: Towards recognizing millions of wikipedia entities

H Hu, Y Luan, Y Chen, U Khandelwal… - Proceedings of the …, 2023 - openaccess.thecvf.com
Large-scale multi-modal pre-training models such as CLIP and PaLI exhibit strong
generalization on various visual domains and tasks. However, existing image classification …

Duet: Cross-modal semantic grounding for contrastive zero-shot learning

Z Chen, Y Huang, J Chen, Y Geng, W Zhang… - Proceedings of the …, 2023 - ojs.aaai.org
Zero-shot learning (ZSL) aims to predict unseen classes whose samples have never
appeared during training. One of the most effective and widely used semantic information for …

Knowledgeable preference alignment for llms in domain-specific question answering

Y Zhang, Z Chen, Y Fang, Y Lu, F Li, W Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Deploying large language models (LLMs) to real scenarios for domain-specific question
answering (QA) is a key thrust for LLM applications, which poses numerous challenges …

Meaformer: Multi-modal entity alignment transformer for meta modality hybrid

Z Chen, J Chen, W Zhang, L Guo, Y Fang… - Proceedings of the 31st …, 2023 - dl.acm.org
Multi-modal entity alignment (MMEA) aims to discover identical entities across different
knowledge graphs (KGs) whose entities are associated with relevant images. However …

Crest: Cross-modal resonance through evidential deep learning for enhanced zero-shot learning

H Huang, X Qiao, Z Chen, H Chen, B Li, Z Sun… - Proceedings of the …, 2024 - dl.acm.org
Zero-shot learning (ZSL) enables the recognition of novel classes by leveraging semantic
knowledge transfer from known to unknown categories. This knowledge, typically …

Rethinking uncertainly missing and ambiguous visual modality in multi-modal entity alignment

Z Chen, L Guo, Y Fang, Y Zhang, J Chen… - International Semantic …, 2023 - Springer
As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to
identify identical entities across disparate knowledge graphs (KGs) by exploiting associated …

Improving sequential model editing with fact retrieval

X Han, R Li, H Tan, W Yuanlong, Q Chai… - Findings of the …, 2023 - aclanthology.org
The task of sequential model editing is to fix erroneous knowledge in Pre-trained Language
Models (PLMs) efficiently, precisely and continuously. Although existing methods can deal …

Context disentangling and prototype inheriting for robust visual grounding

W Tang, L Li, X Liu, L **, J Tang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Visual grounding (VG) aims to locate a specific target in an image based on a given
language query. The discriminative information from context is important for distinguishing …