A comprehensive survey on cross-modal retrieval

K Wang, Q Yin, W Wang, S Wu, L Wang - arxiv preprint arxiv:1607.06215, 2016 - arxiv.org
In recent years, cross-modal retrieval has drawn much attention due to the rapid growth of
multimodal data. It takes one type of data as the query to retrieve relevant data of another …

Cross-modal retrieval: a systematic review of methods and future directions

T Wang, F Li, L Zhu, J Li, Z Zhang… - Proceedings of the …, 2025 - ieeexplore.ieee.org
With the exponential surge in diverse multimodal data, traditional unimodal retrieval
methods struggle to meet the needs of users seeking access to data across various …

Deep supervised cross-modal retrieval

L Zhen, P Hu, X Wang, D Peng - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Cross-modal retrieval aims to enable flexible retrieval across different modalities. The core
of cross-modal retrieval is how to measure the content similarity between different types of …

Self-constraining and attention-based hashing network for bit-scalable cross-modal retrieval

X Wang, X Zou, EM Bakker, S Wu - Neurocomputing, 2020 - Elsevier
Recently deep cross-modal hashing (CMH) have received increased attention in multimedia
information retrieval, as it is able to combine the benefit from the low storage cost and search …

Multi-label modality enhanced attention based self-supervised deep cross-modal hashing

X Zou, S Wu, N Zhang, EM Bakker - Knowledge-Based Systems, 2022 - Elsevier
The recent deep cross-modal hashing (DCMH) has achieved superior performance in
effective and efficient cross-modal retrieval and thus has drawn increasing attention …

Deep triplet neural networks with cluster-cca for audio-visual cross-modal retrieval

D Zeng, Y Yu, K Oyama - ACM Transactions on Multimedia Computing …, 2020 - dl.acm.org
Cross-modal retrieval aims to retrieve data in one modality by a query in another modality,
which has been a very interesting research issue in the field of multimedia, information …

Cross-modal subspace learning for fine-grained sketch-based image retrieval

P Xu, Q Yin, Y Huang, YZ Song, Z Ma, L Wang, T **ang… - Neurocomputing, 2018 - Elsevier
Sketch-based image retrieval (SBIR) is challenging due to the inherent domain-gap
between sketch and photo. Compared with pixel-perfect depictions of photos, sketches are …

Simple to complex cross-modal learning to rank

M Luo, X Chang, Z Li, L Nie, AG Hauptmann… - Computer Vision and …, 2017 - Elsevier
The heterogeneity-gap between different modalities brings a significant challenge to
multimedia information retrieval. Some studies formalize the cross-modal retrieval tasks as a …

Multi-label enhancement based self-supervised deep cross-modal hashing

X Zou, S Wu, EM Bakker, X Wang - Neurocomputing, 2022 - Elsevier
Deep cross-modal hashing which integrates deep learning and hashing into cross-modal
retrieval, achieves better performance than traditional cross-modal retrieval methods …

Twitter100k: A real-world dataset for weakly supervised cross-media retrieval

Y Hu, L Zheng, Y Yang, Y Huang - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
This paper contributes a new large-scale dataset for weakly supervised cross-media
retrieval, named Twitter100k. Current datasets, such as Wikipedia, NUS Wide, and Flickr30k …