An overview of cross-media retrieval: Concepts, methodologies, benchmarks, and challenges

Y Peng, X Huang, Y Zhao - … on circuits and systems for video …, 2017 - ieeexplore.ieee.org
Multimedia retrieval plays an indispensable role in big data utilization. Past efforts mainly
focused on single-media retrieval. However, the requirements of users are highly flexible …

A comprehensive survey on cross-modal retrieval

K Wang, Q Yin, W Wang, S Wu, L Wang - arxiv preprint arxiv:1607.06215, 2016 - arxiv.org
In recent years, cross-modal retrieval has drawn much attention due to the rapid growth of
multimodal data. It takes one type of data as the query to retrieve relevant data of another …

[PDF][PDF] Theoretical Analysis of the Brain and Artificial Intelligence

F Pedro - Journal of Robotics Spectrum, 2023 - anapub.co.ke
Many articles have expounded on and defended the potential advantages of co-robotics
(cobots), robotics, AI, and quantum computers in the domains of research and development …

Dual encoding for video retrieval by text

J Dong, X Li, C Xu, X Yang, G Yang… - … on Pattern Analysis …, 2021 - ieeexplore.ieee.org
This paper attacks the challenging problem of video retrieval by text. In such a retrieval
paradigm, an end user searches for unlabeled videos by ad-hoc queries described …

Deep supervised cross-modal retrieval

L Zhen, P Hu, X Wang, D Peng - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Cross-modal retrieval aims to enable flexible retrieval across different modalities. The core
of cross-modal retrieval is how to measure the content similarity between different types of …

Adversarial cross-modal retrieval

B Wang, Y Yang, X Xu, A Hanjalic… - Proceedings of the 25th …, 2017 - dl.acm.org
Cross-modal retrieval aims to enable flexible retrieval experience across different modalities
(eg, texts vs. images). The core of cross-modal retrieval research is to learn a common …

Deep collaborative embedding for social image understanding

Z Li, J Tang, T Mei - IEEE transactions on pattern analysis and …, 2018 - ieeexplore.ieee.org
In this work, we investigate the problem of learning knowledge from the massive community-
contributed images with rich weakly-supervised context information, which can benefit …

Deep joint-semantics reconstructing hashing for large-scale unsupervised cross-modal retrieval

S Su, Z Zhong, C Zhang - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Cross-modal hashing encodes the multimedia data into a common binary hash space in
which the correlations among the samples from different modalities can be effectively …

A survey of multi-view representation learning

Y Li, M Yang, Z Zhang - IEEE transactions on knowledge and …, 2018 - ieeexplore.ieee.org
Recently, multi-view representation learning has become a rapidly growing direction in
machine learning and data mining areas. This paper introduces two categories for multi …

Survey on deep multi-modal data analytics: Collaboration, rivalry, and fusion

Y Wang - ACM Transactions on Multimedia Computing …, 2021 - dl.acm.org
With the development of web technology, multi-modal or multi-view data has surged as a
major stream for big data, where each modal/view encodes individual property of data …