Knowledge graphs meet multi-modal learning: A comprehensive survey
Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the
semantic web community's exploration into multi-modal dimensions unlocking new avenues …
semantic web community's exploration into multi-modal dimensions unlocking new avenues …
A review on multimodal zero‐shot learning
Multimodal learning provides a path to fully utilize all types of information related to the
modeling target to provide the model with a global vision. Zero‐shot learning (ZSL) is a …
modeling target to provide the model with a global vision. Zero‐shot learning (ZSL) is a …
Open-domain visual entity recognition: Towards recognizing millions of wikipedia entities
Large-scale multi-modal pre-training models such as CLIP and PaLI exhibit strong
generalization on various visual domains and tasks. However, existing image classification …
generalization on various visual domains and tasks. However, existing image classification …
Duet: Cross-modal semantic grounding for contrastive zero-shot learning
Zero-shot learning (ZSL) aims to predict unseen classes whose samples have never
appeared during training. One of the most effective and widely used semantic information for …
appeared during training. One of the most effective and widely used semantic information for …
Knowledgeable preference alignment for llms in domain-specific question answering
Deploying large language models (LLMs) to real scenarios for domain-specific question
answering (QA) is a key thrust for LLM applications, which poses numerous challenges …
answering (QA) is a key thrust for LLM applications, which poses numerous challenges …
Meaformer: Multi-modal entity alignment transformer for meta modality hybrid
Multi-modal entity alignment (MMEA) aims to discover identical entities across different
knowledge graphs (KGs) whose entities are associated with relevant images. However …
knowledge graphs (KGs) whose entities are associated with relevant images. However …
Crest: Cross-modal resonance through evidential deep learning for enhanced zero-shot learning
Zero-shot learning (ZSL) enables the recognition of novel classes by leveraging semantic
knowledge transfer from known to unknown categories. This knowledge, typically …
knowledge transfer from known to unknown categories. This knowledge, typically …
Rethinking uncertainly missing and ambiguous visual modality in multi-modal entity alignment
As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to
identify identical entities across disparate knowledge graphs (KGs) by exploiting associated …
identify identical entities across disparate knowledge graphs (KGs) by exploiting associated …
Improving sequential model editing with fact retrieval
X Han, R Li, H Tan, W Yuanlong, Q Chai… - Findings of the …, 2023 - aclanthology.org
The task of sequential model editing is to fix erroneous knowledge in Pre-trained Language
Models (PLMs) efficiently, precisely and continuously. Although existing methods can deal …
Models (PLMs) efficiently, precisely and continuously. Although existing methods can deal …
Context disentangling and prototype inheriting for robust visual grounding
Visual grounding (VG) aims to locate a specific target in an image based on a given
language query. The discriminative information from context is important for distinguishing …
language query. The discriminative information from context is important for distinguishing …