Crisscrossed captions: Extended intramodal and intermodal semantic similarity judgments for MS-COCO

Z Parekh, J Baldridge, D Cer, A Waters… - arxiv preprint arxiv …, 2020 - arxiv.org
By supporting multi-modal retrieval training and evaluation, image captioning datasets have
spurred remarkable progress on representation learning. Unfortunately, datasets have …

State-of-the-art in language technology and language-centric Artificial Intelligence

R Agerri, E Agirre, I Aldabe, N Aranberri… - … Language Equality: A …, 2023 - Springer
This chapter landscapes the field of Language Technology (LT) and language-centric AI by
assembling a comprehensive state-of-the-art of basic and applied research in the area. It …

Comparative Study and Evaluation of Machine Learning Models for Semantic Textual Similarity

WH Sasoko, A Setyanto… - 2024 8th International …, 2024 - ieeexplore.ieee.org
Semantic Textual Similarity (STS) plays a critical role in various natural language processing
(NLP) applications such as information retrieval, text summarization, and machine …

Bilingual Multimodal Graph Modeling for Text-Image Relation Inference

D Zhang, W Lu, S Li, G Zhou - International Conference on Database …, 2024 - Springer
Text-Image relation inference (TIRI) aims to identify the potential semantic relationships
between text and image. Although previous works have made some progress, there are still …