A systematic literature review on multimodal machine learning: Applications, challenges, gaps and future directions
Multimodal machine learning (MML) is a tempting multidisciplinary research area where
heterogeneous data from multiple modalities and machine learning (ML) are combined to …
heterogeneous data from multiple modalities and machine learning (ML) are combined to …
A decade survey of content based image retrieval using deep learning
SR Dubey - IEEE Transactions on Circuits and Systems for …, 2021 - ieeexplore.ieee.org
The content based image retrieval aims to find the similar images from a large scale dataset
against a query image. Generally, the similarity between the representative features of the …
against a query image. Generally, the similarity between the representative features of the …
Deep multi-view enhancement hashing for image retrieval
Hashing is an efficient method for nearest neighbor search in large-scale data space by
embedding high-dimensional feature descriptors into a similarity preserving Hamming …
embedding high-dimensional feature descriptors into a similarity preserving Hamming …
Joint-modal distribution-based similarity hashing for large-scale unsupervised deep cross-modal retrieval
Hashing-based cross-modal search which aims to map multiple modality features into binary
codes has attracted increasingly attention due to its storage and search efficiency especially …
codes has attracted increasingly attention due to its storage and search efficiency especially …
Modality-invariant asymmetric networks for cross-modal hashing
Cross-modal hashing has garnered considerable attention and gained great success in
many cross-media similarity search applications due to its prominent computational …
many cross-media similarity search applications due to its prominent computational …
A deep semantic alignment network for the cross-modal image-text retrieval in remote sensing
Q Cheng, Y Zhou, P Fu, Y Xu… - IEEE Journal of Selected …, 2021 - ieeexplore.ieee.org
Because of the rapid growth of multimodal data from the internet and social media, a cross-
modal retrieval has become an important and valuable task in recent years. The purpose of …
modal retrieval has become an important and valuable task in recent years. The purpose of …
A survey on deep hashing methods
Nearest neighbor search aims at obtaining the samples in the database with the smallest
distances from them to the queries, which is a basic task in a range of fields, including …
distances from them to the queries, which is a basic task in a range of fields, including …
[HTML][HTML] New ideas and trends in deep multimodal content understanding: A review
The focus of this survey is on the analysis of two modalities of multimodal deep learning:
image and text. Unlike classic reviews of deep learning where monomodal image classifiers …
image and text. Unlike classic reviews of deep learning where monomodal image classifiers …
Flexible body partition-based adversarial learning for visible infrared person re-identification
Person re-identification (Re-ID) aims to retrieve images of the same person across disjoint
camera views. Most Re-ID studies focus on pedestrian images captured by visible cameras …
camera views. Most Re-ID studies focus on pedestrian images captured by visible cameras …
Deep discrete cross-modal hashing with multiple supervision
Deep hashing has been widely used for large-scale cross-modal retrieval benefited from the
low storage cost and fast search speed. However, most existing deep supervised methods …
low storage cost and fast search speed. However, most existing deep supervised methods …