الباحث العلمي من Google

FL Chen, DZ Zhang, ML Han, XY Chen, J Shi… - Machine Intelligence …, 2023‏ - Springer‏

In the past few years, the emergence of pre-training models has brought uni-modal fields
such as computer vision (CV) and natural language processing (NLP) to a new era …‏

حفظ اقتباس تم اقتباسها في عدد: 218 مقالات ذات صلة الإصدارات الـ 10كلها

[Free GPT-4]

[PDF] arxiv.org

Image-text retrieval: A survey on recent research and development‏

M Cao, S Li, J Li, L Nie, M Zhang - arxiv preprint arxiv:2203.14713, 2022‏ - arxiv.org‏

In the past few years, cross-modal image-text retrieval (ITR) has experienced increased
interest in the research community due to its excellent research value and broad real-world …‏

حفظ اقتباس تم اقتباسها في عدد: 103 مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

[Free GPT-4]

[PDF] thecvf.com

Gloria: A multimodal global-local representation learning framework for label-efficient medical image recognition‏

SC Huang, L Shen, MP Lungren… - Proceedings of the …, 2021‏ - openaccess.thecvf.com‏

In recent years, the growing number of medical imaging studies is placing an ever-
increasing burden on radiologists. Deep learning provides a promising solution for …‏

حفظ اقتباس تم اقتباسها في عدد: 341 مقالات ذات صلة الإصدارات الـ 6كلها إصدار HTML‏

[Free GPT-4]

[PDF] thecvf.com

Negative-aware attention framework for image-text matching‏

K Zhang, Z Mao, Q Wang… - Proceedings of the IEEE …, 2022‏ - openaccess.thecvf.com‏

Image-text matching, as a fundamental task, bridges the gap between vision and language.
The key of this task is to accurately measure similarity between these two modalities. Prior …‏

حفظ اقتباس تم اقتباسها في عدد: 147 مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

[Free GPT-4]

[PDF] nature.com

Towards artificial general intelligence via a multimodal foundation model‏

N Fei, Z Lu, Y Gao, G Yang, Y Huo, J Wen, H Lu… - Nature …, 2022‏ - nature.com‏

The fundamental goal of artificial intelligence (AI) is to mimic the core cognitive activities of
human. Despite tremendous success in the AI research, most of existing methods have only …‏

حفظ اقتباس تم اقتباسها في عدد: 237 مقالات ذات صلة الإصدارات الـ 15كلها

[Free GPT-4]

[PDF] aaai.org

Similarity reasoning and filtration for image-text matching‏

H Diao, Y Zhang, L Ma, H Lu - Proceedings of the AAAI conference on …, 2021‏ - ojs.aaai.org‏

Image-text matching plays a critical role in bridging the vision and language, and great
progress has been made by exploiting the global alignment between image and sentence …‏

حفظ اقتباس تم اقتباسها في عدد: 359 مقالات ذات صلة الإصدارات الـ 9كلها إصدار HTML‏

Dual-level representation enhancement on characteristic and context for image-text retrieval‏

S Yang, Q Li, W Li, X Li, AA Liu - IEEE Transactions on Circuits …, 2022‏ - ieeexplore.ieee.org‏

Image-text retrieval is a fundamental and vital task in multi-media retrieval and has received
growing attention since it connects heterogeneous data. Previous methods that perform well …‏

حفظ اقتباس تم اقتباسها في عدد: 106 مقالات ذات صلة الإصدارات الـ 2كلها

[Free GPT-4]

[PDF] thecvf.com

Learning the best pooling strategy for visual semantic embedding‏

J Chen, H Hu, H Wu, Y Jiang… - Proceedings of the IEEE …, 2021‏ - openaccess.thecvf.com‏

Abstract Visual Semantic Embedding (VSE) is a dominant approach for vision-language
retrieval, which aims at learning a deep embedding space such that visual data are …‏

حفظ اقتباس تم اقتباسها في عدد: 271 مقالات ذات صلة الإصدارات الـ 9كلها إصدار HTML‏

[Free GPT-4]

[PDF] aber.ac.uk

Region-object relation-aware dense captioning via transformer‏

Z Shao, J Han, D Marnerides… - IEEE Transactions on …, 2022‏ - ieeexplore.ieee.org‏

Dense captioning provides detailed captions of complex visual scenes. While a number of
successes have been achieved in recent years, there are still two broad limitations: 1) most …‏

حفظ اقتباس تم اقتباسها في عدد: 136 مقالات ذات صلة الإصدارات الـ 9كلها

[Free GPT-4]

[PDF] researchgate.net

Dynamic modality interaction modeling for image-text retrieval‏

L Qu, M Liu, J Wu, Z Gao, L Nie - … of the 44th International ACM SIGIR …, 2021‏ - dl.acm.org‏

Image-text retrieval is a fundamental and crucial branch in information retrieval. Although
much progress has been made in bridging vision and language, it remains challenging …‏

حفظ اقتباس تم اقتباسها في عدد: 166 مقالات ذات صلة الإصدارات الـ 3كلها

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

Imram: Iterative matching with recurrent attention memory for cross-modal image-text retrieval

Vlp: A survey on vision-language pre-training‏

Image-text retrieval: A survey on recent research and development‏

Gloria: A multimodal global-local representation learning framework for label-efficient medical image recognition‏

Negative-aware attention framework for image-text matching‏

Towards artificial general intelligence via a multimodal foundation model‏

Similarity reasoning and filtration for image-text matching‏

Dual-level representation enhancement on characteristic and context for image-text retrieval‏

Learning the best pooling strategy for visual semantic embedding‏

Region-object relation-aware dense captioning via transformer‏

Dynamic modality interaction modeling for image-text retrieval‏