- Academic Search

Z Chen, Y Zhang, Y Fang, Y Geng, L Guo… - arxiv preprint arxiv …, 2024 - arxiv.org

Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the
semantic web community's exploration into multi-modal dimensions unlocking new avenues …

Save Cite Cited by 44 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] google.com

A review on multimodal zero‐shot learning

W Cao, Y Wu, Y Sun, H Zhang, J Ren… - … : Data Mining and …, 2023 - Wiley Online Library

Multimodal learning provides a path to fully utilize all types of information related to the
modeling target to provide the model with a global vision. Zero‐shot learning (ZSL) is a …

Save Cite Cited by 30 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Open-domain visual entity recognition: Towards recognizing millions of wikipedia entities

H Hu, Y Luan, Y Chen, U Khandelwal… - Proceedings of the …, 2023 - openaccess.thecvf.com

Large-scale multi-modal pre-training models such as CLIP and PaLI exhibit strong
generalization on various visual domains and tasks. However, existing image classification …

Save Cite Cited by 48 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aaai.org

Duet: Cross-modal semantic grounding for contrastive zero-shot learning

Z Chen, Y Huang, J Chen, Y Geng, W Zhang… - Proceedings of the …, 2023 - ojs.aaai.org

Zero-shot learning (ZSL) aims to predict unseen classes whose samples have never
appeared during training. One of the most effective and widely used semantic information for …

Save Cite Cited by 66 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Knowledgeable preference alignment for llms in domain-specific question answering

Y Zhang, Z Chen, Y Fang, Y Lu, F Li, W Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org

Deploying large language models (LLMs) to real scenarios for domain-specific question
answering (QA) is a key thrust for LLM applications, which poses numerous challenges …

Save Cite Cited by 24 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Meaformer: Multi-modal entity alignment transformer for meta modality hybrid

Z Chen, J Chen, W Zhang, L Guo, Y Fang… - Proceedings of the 31st …, 2023 - dl.acm.org

Multi-modal entity alignment (MMEA) aims to discover identical entities across different
knowledge graphs (KGs) whose entities are associated with relevant images. However …

Save Cite Cited by 57 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Crest: Cross-modal resonance through evidential deep learning for enhanced zero-shot learning

H Huang, X Qiao, Z Chen, H Chen, B Li, Z Sun… - Proceedings of the …, 2024 - dl.acm.org

Zero-shot learning (ZSL) enables the recognition of novel classes by leveraging semantic
knowledge transfer from known to unknown categories. This knowledge, typically …

Save Cite Cited by 10 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Rethinking uncertainly missing and ambiguous visual modality in multi-modal entity alignment

Z Chen, L Guo, Y Fang, Y Zhang, J Chen… - International Semantic …, 2023 - Springer

As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to
identify identical entities across disparate knowledge graphs (KGs) by exploiting associated …

Save Cite Cited by 31 Related articles All 8 versions Free GPT-4

[Free GPT-4]

[PDF] aclanthology.org

Improving sequential model editing with fact retrieval

X Han, R Li, H Tan, W Yuanlong, Q Chai… - Findings of the …, 2023 - aclanthology.org

The task of sequential model editing is to fix erroneous knowledge in Pre-trained Language
Models (PLMs) efficiently, precisely and continuously. Although existing methods can deal …

Save Cite Cited by 15 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Context disentangling and prototype inheriting for robust visual grounding

W Tang, L Li, X Liu, L **, J Tang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Visual grounding (VG) aims to locate a specific target in an image based on a given
language query. The discriminative information from context is important for distinguishing …

Save Cite Cited by 20 Related articles All 7 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Zero-shot visual question answering using knowledge graph

Knowledge graphs meet multi-modal learning: A comprehensive survey

A review on multimodal zero‐shot learning

Open-domain visual entity recognition: Towards recognizing millions of wikipedia entities

Duet: Cross-modal semantic grounding for contrastive zero-shot learning

Knowledgeable preference alignment for llms in domain-specific question answering

Meaformer: Multi-modal entity alignment transformer for meta modality hybrid

Crest: Cross-modal resonance through evidential deep learning for enhanced zero-shot learning

Rethinking uncertainly missing and ambiguous visual modality in multi-modal entity alignment

Improving sequential model editing with fact retrieval

Context disentangling and prototype inheriting for robust visual grounding