Cross-modal retrieval: a systematic review of methods and future directions
With the exponential surge in diverse multimodal data, traditional unimodal retrieval
methods struggle to meet the needs of users seeking access to data across various …
methods struggle to meet the needs of users seeking access to data across various …
Comparative analysis on cross-modal information retrieval: A review
Human beings experience life through a spectrum of modes such as vision, taste, hearing,
smell, and touch. These multiple modes are integrated for information processing in our …
smell, and touch. These multiple modes are integrated for information processing in our …
Simple unsupervised graph representation learning
In this paper, we propose a simple unsupervised graph representation learning method to
conduct effective and efficient contrastive learning. Specifically, the proposed multiplet loss …
conduct effective and efficient contrastive learning. Specifically, the proposed multiplet loss …
Deep fuzzy hashing network for efficient image retrieval
Hashing methods for efficient image retrieval aim at learning hash functions that map similar
images to semantically correlated binary codes in the Hamming space with similarity well …
images to semantically correlated binary codes in the Hamming space with similarity well …
Cross-modal attention with semantic consistence for image–text matching
The task of image-text matching refers to measuring the visual-semantic similarity between
an image and a sentence. Recently, the fine-grained matching methods that explore the …
an image and a sentence. Recently, the fine-grained matching methods that explore the …
Survey on deep multi-modal data analytics: Collaboration, rivalry, and fusion
Y Wang - ACM Transactions on Multimedia Computing …, 2021 - dl.acm.org
With the development of web technology, multi-modal or multi-view data has surged as a
major stream for big data, where each modal/view encodes individual property of data …
major stream for big data, where each modal/view encodes individual property of data …
Deep incomplete multi-view clustering with cross-view partial sample and prototype alignment
The success of existing multi-view clustering relies on the assumption of sample integrity
across multiple views. However, in real-world scenarios, samples of multi-view are partially …
across multiple views. However, in real-world scenarios, samples of multi-view are partially …
Exploiting subspace relation in semantic labels for cross-modal hashing
Hashing methods have been extensively applied to efficient multimedia data indexing and
retrieval on account of the explosion of multimedia data. Cross-modal hashing usually …
retrieval on account of the explosion of multimedia data. Cross-modal hashing usually …
Deep incomplete multi-view clustering via mining cluster complementarity
Incomplete multi-view clustering (IMVC) is an important unsupervised approach to group the
multi-view data containing missing data in some views. Previous IMVC methods suffer from …
multi-view data containing missing data in some views. Previous IMVC methods suffer from …
SCCGAN: style and characters inpainting based on CGAN
R Liu, X Wang, H Lu, Z Wu, Q Fan, S Li… - Mobile networks and …, 2021 - Springer
With the development of deep learning technology, many deep learning methods have been
applied to font recognition and generation. However, few studies focus on font inpainting …
applied to font recognition and generation. However, few studies focus on font inpainting …