Multimodal machine learning: A survey and taxonomy

T Baltrušaitis, C Ahuja… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Our experience of the world is multimodal-we see objects, hear sounds, feel texture, smell
odors, and taste flavors. Modality refers to the way in which something happens or is …

A survey on deep learning for multimodal data fusion

J Gao, P Li, Z Chen, J Zhang - Neural Computation, 2020 - direct.mit.edu
With the wide deployments of heterogeneous networks, huge amounts of data with
characteristics of high volume, high variety, high velocity, and high veracity are generated …

Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval

Y Gong, S Lazebnik, A Gordo… - IEEE transactions on …, 2012 - ieeexplore.ieee.org
This paper addresses the problem of learning similarity-preserving binary codes for efficient
similarity search in large-scale image collections. We formulate this problem in terms of …

A systematic literature review on crop yield prediction with deep learning and remote sensing

P Muruganantham, S Wibowo, S Grandhi, NH Samrat… - Remote Sensing, 2022 - mdpi.com
Deep learning has emerged as a potential tool for crop yield prediction, allowing the model
to automatically extract features and learn from the datasets. Meanwhile, smart farming …

Deformable medical image registration: A survey

A Sotiras, C Davatzikos… - IEEE transactions on …, 2013 - ieeexplore.ieee.org
Deformable image registration is a fundamental task in medical image processing. Among
its most important applications, one may cite: 1) multi-modality fusion, where information …

Deep cross-modal hashing

QY Jiang, WJ Li - Proceedings of the IEEE conference on …, 2017 - openaccess.thecvf.com
Due to its low storage cost and fast query speed, cross-modal hashing (CMH) has been
widely used for similarity search in multimedia retrieval applications. However, most existing …

Self-supervised adversarial hashing networks for cross-modal retrieval

C Li, C Deng, N Li, W Liu, X Gao… - Proceedings of the …, 2018 - openaccess.thecvf.com
Thanks to the success of deep learning, cross-modal retrieval has made significant progress
recently. However, there still remains a crucial bottleneck: how to bridge the modality gap to …

Dolg: Single-stage image retrieval with deep orthogonal fusion of local and global features

M Yang, D He, M Fan, B Shi, X Xue… - Proceedings of the …, 2021 - openaccess.thecvf.com
Image Retrieval is a fundamental task of obtaining images similar to the query one from a
database. A common image retrieval practice is to firstly retrieve candidate images via …

A survey of multi-view representation learning

Y Li, M Yang, Z Zhang - IEEE transactions on knowledge and …, 2018 - ieeexplore.ieee.org
Recently, multi-view representation learning has become a rapidly growing direction in
machine learning and data mining areas. This paper introduces two categories for multi …

Learning to hash for indexing big data—A survey

J Wang, W Liu, S Kumar, SF Chang - Proceedings of the IEEE, 2015 - ieeexplore.ieee.org
The explosive growth in Big Data has attracted much attention in designing efficient indexing
and search methods recently. In many critical applications such as large-scale search and …