Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Knowledge graphs meet multi-modal learning: A comprehensive survey
Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the
semantic web community's exploration into multi-modal dimensions unlocking new avenues …
semantic web community's exploration into multi-modal dimensions unlocking new avenues …
Unifying large language models and knowledge graphs: A roadmap
Large language models (LLMs), such as ChatGPT and GPT4, are making new waves in the
field of natural language processing and artificial intelligence, due to their emergent ability …
field of natural language processing and artificial intelligence, due to their emergent ability …
Large language models are visual reasoning coordinators
L Chen, B Li, S Shen, J Yang, C Li… - Advances in …, 2023 - proceedings.neurips.cc
Visual reasoning requires multimodal perception and commonsense cognition of the world.
Recently, multiple vision-language models (VLMs) have been proposed with excellent …
Recently, multiple vision-language models (VLMs) have been proposed with excellent …
Fine-grained late-interaction multi-modal retrieval for retrieval augmented visual question answering
W Lin, J Chen, J Mei, A Coca… - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract Knowledge-based Visual Question Answering (KB-VQA) requires VQA systems to
utilize knowledge from external knowledge bases to answer visually-grounded questions …
utilize knowledge from external knowledge bases to answer visually-grounded questions …
Meaformer: Multi-modal entity alignment transformer for meta modality hybrid
Multi-modal entity alignment (MMEA) aims to discover identical entities across different
knowledge graphs (KGs) whose entities are associated with relevant images. However …
knowledge graphs (KGs) whose entities are associated with relevant images. However …
Rethinking uncertainly missing and ambiguous visual modality in multi-modal entity alignment
As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to
identify identical entities across disparate knowledge graphs (KGs) by exploiting associated …
identify identical entities across disparate knowledge graphs (KGs) by exploiting associated …
A symmetric dual encoding dense retrieval framework for knowledge-intensive visual question answering
Knowledge-Intensive Visual Question Answering (KI-VQA) refers to answering a question
about an image whose answer does not lie in the image. This paper presents a new pipeline …
about an image whose answer does not lie in the image. This paper presents a new pipeline …
[PDF][PDF] Structure-clip: Enhance multi-modal language representations with structure knowledge
Large-scale vision-language pre-training has shown promising advances on various
downstream tasks and achieved significant performance in multi-modal understanding and …
downstream tasks and achieved significant performance in multi-modal understanding and …
Structure-clip: Towards scene graph knowledge to enhance multi-modal structured representations
Large-scale vision-language pre-training has achieved significant performance in multi-
modal understanding and generation tasks. However, existing methods often perform poorly …
modal understanding and generation tasks. However, existing methods often perform poorly …
Tele-knowledge pre-training for fault analysis
In this work, we share our experience on tele-knowledge pre-training for fault analysis, a
crucial task in telecommunication applications that requires a wide range of knowledge …
crucial task in telecommunication applications that requires a wide range of knowledge …