A systematic literature review on multimodal machine learning: Applications, challenges, gaps and future directions

A Barua, MU Ahmed, S Begum - IEEE Access, 2023 - ieeexplore.ieee.org
Multimodal machine learning (MML) is a tempting multidisciplinary research area where
heterogeneous data from multiple modalities and machine learning (ML) are combined to …

A decade survey of content based image retrieval using deep learning

SR Dubey - IEEE Transactions on Circuits and Systems for …, 2021 - ieeexplore.ieee.org
The content based image retrieval aims to find the similar images from a large scale dataset
against a query image. Generally, the similarity between the representative features of the …

Deep multi-view enhancement hashing for image retrieval

C Yan, B Gong, Y Wei, Y Gao - IEEE Transactions on Pattern …, 2020 - ieeexplore.ieee.org
Hashing is an efficient method for nearest neighbor search in large-scale data space by
embedding high-dimensional feature descriptors into a similarity preserving Hamming …

Joint-modal distribution-based similarity hashing for large-scale unsupervised deep cross-modal retrieval

S Liu, S Qian, Y Guan, J Zhan, L Ying - Proceedings of the 43rd …, 2020 - dl.acm.org
Hashing-based cross-modal search which aims to map multiple modality features into binary
codes has attracted increasingly attention due to its storage and search efficiency especially …

Modality-invariant asymmetric networks for cross-modal hashing

Z Zhang, H Luo, L Zhu, G Lu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Cross-modal hashing has garnered considerable attention and gained great success in
many cross-media similarity search applications due to its prominent computational …

A deep semantic alignment network for the cross-modal image-text retrieval in remote sensing

Q Cheng, Y Zhou, P Fu, Y Xu… - IEEE Journal of Selected …, 2021 - ieeexplore.ieee.org
Because of the rapid growth of multimodal data from the internet and social media, a cross-
modal retrieval has become an important and valuable task in recent years. The purpose of …

A survey on deep hashing methods

X Luo, H Wang, D Wu, C Chen, M Deng… - ACM Transactions on …, 2023 - dl.acm.org
Nearest neighbor search aims at obtaining the samples in the database with the smallest
distances from them to the queries, which is a basic task in a range of fields, including …

[HTML][HTML] New ideas and trends in deep multimodal content understanding: A review

W Chen, W Wang, L Liu, MS Lew - Neurocomputing, 2021 - Elsevier
The focus of this survey is on the analysis of two modalities of multimodal deep learning:
image and text. Unlike classic reviews of deep learning where monomodal image classifiers …

Flexible body partition-based adversarial learning for visible infrared person re-identification

Z Wei, X Yang, N Wang, X Gao - IEEE Transactions on Neural …, 2021 - ieeexplore.ieee.org
Person re-identification (Re-ID) aims to retrieve images of the same person across disjoint
camera views. Most Re-ID studies focus on pedestrian images captured by visible cameras …

Deep discrete cross-modal hashing with multiple supervision

E Yu, J Ma, J Sun, X Chang, H Zhang, AG Hauptmann - Neurocomputing, 2022 - Elsevier
Deep hashing has been widely used for large-scale cross-modal retrieval benefited from the
low storage cost and fast search speed. However, most existing deep supervised methods …