Cross-modal retrieval: a systematic review of methods and future directions

T Wang, F Li, L Zhu, J Li, Z Zhang… - Proceedings of the …, 2025 - ieeexplore.ieee.org
With the exponential surge in diverse multimodal data, traditional unimodal retrieval
methods struggle to meet the needs of users seeking access to data across various …

Prada: Practical black-box adversarial attacks against neural ranking models

C Wu, R Zhang, J Guo, M De Rijke, Y Fan… - ACM Transactions on …, 2023 - dl.acm.org
Neural ranking models (NRMs) have shown remarkable success in recent years, especially
with pre-trained language models. However, deep neural models are notorious for their …

Targeted adversarial attack against deep cross-modal hashing retrieval

T Wang, L Zhu, Z Zhang, H Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Deep cross-modal hashing has achieved excellent retrieval performance with the powerful
representation capability of deep neural networks. Regrettably, current methods are …

Universal adversarial perturbations for vision-language pre-trained models

PF Zhang, Z Huang, G Bai - Proceedings of the 47th International ACM …, 2024 - dl.acm.org
Vision-language pre-trained (VLP) models have been the foundation of numerous vision-
language tasks. Given their prevalence, it becomes imperative to assess their adversarial …

Invisible black-box backdoor attack against deep cross-modal hashing retrieval

T Wang, F Li, L Zhu, J Li, Z Zhang… - ACM Transactions on …, 2024 - dl.acm.org
Deep cross-modal hashing has promoted the field of multi-modal retrieval due to its
excellent efficiency and storage, but its vulnerability to backdoor attacks is rarely studied …

Once and for all: Universal transferable adversarial perturbation against deep hashing-based facial image retrieval

L Tang, D Ye, Y Lv, C Chen, Y Zhang - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Deep Hashing (DH) based image retrieval has been widely applied to face-matching
systems due to its accuracy and efficiency. However, this convenience comes with an …

Hypergraph-enhanced hashing for unsupervised cross-modal retrieval via robust similarity guidance

F Zhong, C Chu, Z Zhu, Z Chen - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Unsupervised cross-modal hashing retrieval across image and text modality is a challenging
task because of the suboptimality of similarity guidance, ie, the joint similarity matrix …

Multi-layer Probabilistic Association Reasoning Network for Image-Text Retrieval

W Li, R **ong, X Fan - … on Circuits and Systems for Video …, 2024 - ieeexplore.ieee.org
With the advancement of deep learning, the task of image-text retrieval has received
widespread attention for addressing the semantic heterogeneity in multimodal data …

A privacy-preserving cross-media retrieval on encrypted data in cloud computing

Z Wang, J Qin, X **ang, Y Tan, J Peng - Journal of Information Security and …, 2023 - Elsevier
Frequent cloud data breaches cause irreparable damage to cloud users and providers.
Cross-media retrieval can better leverage the value of data, but existing cross-media …

Mitigating Cross-modal Retrieval Violations with Privacy-preserving Backdoor Learning

Q Liu, Y Qiu, T Zhou, M Xu, J Qin, W Ma… - … on Circuits and …, 2024 - ieeexplore.ieee.org
Deep cross-modal retrieval, with its effective and efficient search capabilities, has gained
widespread adoption in today's media-sharing practices yet raises concerns regarding …