Multimodal machine learning: A survey and taxonomy

T Baltrušaitis, C Ahuja… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Our experience of the world is multimodal-we see objects, hear sounds, feel texture, smell
odors, and taste flavors. Modality refers to the way in which something happens or is …

Multi-modal machine learning in engineering design: A review and future directions

B Song, R Zhou, F Ahmed - … of Computing and …, 2024 - asmedigitalcollection.asme.org
In the rapidly advancing field of multi-modal machine learning (MMML), the convergence of
multiple data modalities has the potential to reshape various applications. This paper …

A decade survey of content based image retrieval using deep learning

SR Dubey - IEEE Transactions on Circuits and Systems for …, 2021 - ieeexplore.ieee.org
The content based image retrieval aims to find the similar images from a large scale dataset
against a query image. Generally, the similarity between the representative features of the …

Deep supervised cross-modal retrieval

L Zhen, P Hu, X Wang, D Peng - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Cross-modal retrieval aims to enable flexible retrieval across different modalities. The core
of cross-modal retrieval is how to measure the content similarity between different types of …

Deep hashing network for unsupervised domain adaptation

H Venkateswara, J Eusebio… - Proceedings of the …, 2017 - openaccess.thecvf.com
In recent years, deep neural networks have emerged as a dominant machine learning tool
for a wide variety of application domains. However, training a deep neural network requires …

Self-supervised adversarial hashing networks for cross-modal retrieval

C Li, C Deng, N Li, W Liu, X Gao… - Proceedings of the …, 2018 - openaccess.thecvf.com
Thanks to the success of deep learning, cross-modal retrieval has made significant progress
recently. However, there still remains a crucial bottleneck: how to bridge the modality gap to …

Deep joint-semantics reconstructing hashing for large-scale unsupervised cross-modal retrieval

S Su, Z Zhong, C Zhang - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Cross-modal hashing encodes the multimedia data into a common binary hash space in
which the correlations among the samples from different modalities can be effectively …

Deep cauchy hashing for hamming space retrieval

Y Cao, M Long, B Liu, J Wang - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Due to its computation efficiency and retrieval quality, hashing has been widely applied to
approximate nearest neighbor search for large-scale image retrieval, while deep hashing …

Deep cross-modal hashing

QY Jiang, WJ Li - Proceedings of the IEEE conference on …, 2017 - openaccess.thecvf.com
Due to its low storage cost and fast query speed, cross-modal hashing (CMH) has been
widely used for similarity search in multimedia retrieval applications. However, most existing …

Adversarial representation learning for text-to-image matching

N Sarafianos, X Xu… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
For many computer vision applications such as image captioning, visual question
answering, and person search, learning discriminative feature representations at both image …