Deep multi-view learning methods: A review

X Yan, S Hu, Y Mao, Y Ye, H Yu - Neurocomputing, 2021 - Elsevier
Multi-view learning (MVL) has attracted increasing attention and achieved great practical
success by exploiting complementary information of multiple features or modalities …

Multimodal machine learning: A survey and taxonomy

T Baltrušaitis, C Ahuja… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Our experience of the world is multimodal-we see objects, hear sounds, feel texture, smell
odors, and taste flavors. Modality refers to the way in which something happens or is …

A decade survey of content based image retrieval using deep learning

SR Dubey - IEEE Transactions on Circuits and Systems for …, 2021 - ieeexplore.ieee.org
The content based image retrieval aims to find the similar images from a large scale dataset
against a query image. Generally, the similarity between the representative features of the …

Fine-grained video-text retrieval with hierarchical graph reasoning

S Chen, Y Zhao, Q **, Q Wu - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
Cross-modal retrieval between videos and texts has attracted growing attentions due to the
rapid emergence of videos on the web. The current dominant approach is to learn a joint …

Deep multimodal representation learning: A survey

W Guo, J Wang, S Wang - Ieee Access, 2019 - ieeexplore.ieee.org
Multimodal representation learning, which aims to narrow the heterogeneity gap among
different modalities, plays an indispensable role in the utilization of ubiquitous multimodal …

Hit: Hierarchical transformer with momentum contrast for video-text retrieval

S Liu, H Fan, S Qian, Y Chen… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Video-Text Retrieval has been a hot research topic with the growth of multimedia
data on the internet. Transformer for video-text learning has attracted increasing attention …

Deep supervised cross-modal retrieval

L Zhen, P Hu, X Wang, D Peng - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Cross-modal retrieval aims to enable flexible retrieval across different modalities. The core
of cross-modal retrieval is how to measure the content similarity between different types of …

A survey of deep neural network architectures and their applications

W Liu, Z Wang, X Liu, N Zeng, Y Liu, FE Alsaadi - Neurocomputing, 2017 - Elsevier
Since the proposal of a fast learning algorithm for deep belief networks in 2006, the deep
learning techniques have drawn ever-increasing research interests because of their …

Adversarial cross-modal retrieval

B Wang, Y Yang, X Xu, A Hanjalic… - Proceedings of the 25th …, 2017 - dl.acm.org
Cross-modal retrieval aims to enable flexible retrieval experience across different modalities
(eg, texts vs. images). The core of cross-modal retrieval research is to learn a common …

Multimodal co-learning: Challenges, applications with datasets, recent advances and future directions

A Rahate, R Walambe, S Ramanna, K Kotecha - Information Fusion, 2022 - Elsevier
Multimodal deep learning systems that employ multiple modalities like text, image, audio,
video, etc., are showing better performance than individual modalities (ie, unimodal) …