Dual encoding for video retrieval by text

J Dong, X Li, C Xu, X Yang, G Yang… - … on Pattern Analysis …, 2021 - ieeexplore.ieee.org
This paper attacks the challenging problem of video retrieval by text. In such a retrieval
paradigm, an end user searches for unlabeled videos by ad-hoc queries described …

Dual encoding for zero-example video retrieval

J Dong, X Li, C Xu, S Ji, Y He… - Proceedings of the …, 2019 - openaccess.thecvf.com
This paper attacks the challenging problem of zero-example video retrieval. In such a
retrieval paradigm, an end user searches for unlabeled videos by ad-hoc queries described …

Interactive video retrieval in the age of effective joint embedding deep models: lessons from the 11th VBS

J Lokoč, S Andreadis, W Bailer, A Duane, C Gurrin… - Multimedia …, 2023 - Springer
This paper presents findings of the eleventh Video Browser Showdown competition, where
sixteen teams competed in known-item and ad-hoc search tasks. Many of the teams utilized …

Tree-augmented cross-modal encoding for complex-query video retrieval

X Yang, J Dong, Y Cao, X Wang, M Wang… - Proceedings of the 43rd …, 2020 - dl.acm.org
The rapid growth of user-generated videos on the Internet has intensified the need for text-
based video retrieval systems. Traditional methods mainly favor the concept-based …

W2vv++ fully deep learning for ad-hoc video search

X Li, C Xu, G Yang, Z Chen, J Dong - Proceedings of the 27th ACM …, 2019 - dl.acm.org
Ad-hoc video search (AVS) is an important yet challenging problem in multimedia retrieval.
Different from previous concept-based methods, we propose a fully deep learning method …

Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th video browser showdown

S Heller, V Gsteiger, W Bailer, C Gurrin… - International Journal of …, 2022 - Springer
Abstract The Video Browser Showdown addresses difficult video search challenges through
an annual interactive evaluation campaign attracting research teams focusing on interactive …

A comprehensive review of the video-to-text problem

J Perez-Martin, B Bustos, SJF Guimarães… - Artificial Intelligence …, 2022 - Springer
Research in the Vision and Language area encompasses challenging topics that seek to
connect visual and textual information. When the visual information is related to videos, this …

Lightweight attentional feature fusion: A new baseline for text-to-video retrieval

F Hu, A Chen, Z Wang, F Zhou, J Dong, X Li - European conference on …, 2022 - Springer
In this paper we revisit feature fusion, an old-fashioned topic, in the new context of text-to-
video retrieval. Different from previous research that considers feature fusion only at one …

SEA: Sentence encoder assembly for video retrieval by textual queries

X Li, F Zhou, C Xu, J Ji, G Yang - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Retrieving unlabeled videos by textual queries, known as Ad-hoc Video Search (AVS), is a
core theme in multimedia data management and retrieval. The success of AVS counts on …

Comparison of fine-tuning and extension strategies for deep convolutional neural networks

N Pittaras, F Markatopoulou, V Mezaris… - MultiMedia Modeling: 23rd …, 2017 - Springer
In this study we compare three different fine-tuning strategies in order to investigate the best
way to transfer the parameters of popular deep convolutional neural networks that were …