Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Unifying knowledge iterative dissemination and relational reconstruction network for image–text matching
Image–text matching is a crucial branch in multimedia retrieval which relies on learning inter-
modal correspondences. Most existing methods focus on global or local correspondence …
modal correspondences. Most existing methods focus on global or local correspondence …
Heterogeneous Graph Fusion Network for cross-modal image-text retrieval
X Qin, L Li, G Pang, F Hao - Expert Systems with Applications, 2024 - Elsevier
Exploring the semantic correspondence of image-text pairs is significant as it bridges vision
and language. Most prior works focus on global semantic alignment or local semantic …
and language. Most prior works focus on global semantic alignment or local semantic …
Multi-level knowledge-driven feature representation and triplet loss optimization network for image–text retrieval
X Qin, L Li, F Hao, M Ge, G Pang - Information Processing & Management, 2024 - Elsevier
Image–text retrieval plays a considerable role in associating vision and language. Existing
mainstream approaches focus on fine-grained alignment while ignoring the influence of …
mainstream approaches focus on fine-grained alignment while ignoring the influence of …
Bridging the cross-modality semantic gap in visual question answering
The objective of visual question answering (VQA) is to adequately comprehend a question
and identify relevant contents in an image that can provide an answer. Existing approaches …
and identify relevant contents in an image that can provide an answer. Existing approaches …
Multi-level Symmetric Semantic Alignment Network for image–text matching
W Wang, X Di, M Liu, F Gao - Neurocomputing, 2024 - Elsevier
Image–text matching has attracted much attention as one of the visual-linguistic tasks. Most
of the existing methods tend to concentrate on single-level semantic similarity by global …
of the existing methods tend to concentrate on single-level semantic similarity by global …
Multi-scale motivated neural network for image-text matching
X Qin, L Li, G Pang - Multimedia Tools and Applications, 2024 - Springer
Existing mainstream image-text matching methods usually measure the relevance of image-
text pairs by capturing and aggregating the affinities between textual words and visual …
text pairs by capturing and aggregating the affinities between textual words and visual …
Multi-task visual semantic embedding network for image-text retrieval
Image-text retrieval aims to capture the semantic correspondence between images and
texts, which serves as a foundation and crucial component in multi-modal recommendations …
texts, which serves as a foundation and crucial component in multi-modal recommendations …
Global-guided asymmetric attention network for image-text matching
D Wu, H Li, Y Tang, L Guo, H Liu - Neurocomputing, 2022 - Elsevier
Image-text matching is a vital yet challenging task in the field of vision and language. Unlike
previous methods that usually adopt a symmetrical network to independently embed images …
previous methods that usually adopt a symmetrical network to independently embed images …
Cross-modal information balance-aware reasoning network for image-text retrieval
X Qin, L Li, F Hao, G Pang, Z Wang - Engineering Applications of Artificial …, 2023 - Elsevier
As a fundamental multimodal task, image-text retrieval bridges the gap between vision and
language. Current mainstream methods exploit attention mechanisms to discover potential …
language. Current mainstream methods exploit attention mechanisms to discover potential …
Visual Contextual Semantic Reasoning for Cross-Modal Drone Image-Text Retrieval
The cross-modal drone image-text (DIT) retrieval task involves using either text or drone
images as queries to retrieve relevant drone images or corresponding text. The primary …
images as queries to retrieve relevant drone images or corresponding text. The primary …