Based on spatial and temporal implicit semantic relational inference for cross-modal retrieval

M **, W Hu, L Zhu, X Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
To meet users' demands for video retrieval, text-video cross-modal retrieval technology
continues to evolve. Methods based on pre-trained models and transfer learning are widely …

Advances in information retrieval collection on the European conference on information retrieval 2023

J Kamps, L Goeuriot, F Crestani - Discover Computing, 2024 - Springer
This paper introduces the Collection on ECIR 2023. The 45th European Conference on
Information Retrieval (ECIR 2023) was held in Dublin, Ireland, during April 2–6, 2023. The …

Enhancing Video-Language Alignment Via Mining Entity Knowledge

C Wang, X Dong, J Gu, Y Wen, R Hunag, J Han… - Available at SSRN … - papers.ssrn.com
Abstract Video-Language Alignment is a crucial task with applications in video-text retrieval
and captioning, involving aligning video content with textual descriptions. However …