An overview of cross-media retrieval: Concepts, methodologies, benchmarks, and challenges

Y Peng, X Huang, Y Zhao - … on circuits and systems for video …, 2017 - ieeexplore.ieee.org
Multimedia retrieval plays an indispensable role in big data utilization. Past efforts mainly
focused on single-media retrieval. However, the requirements of users are highly flexible …

Cross-media analysis and reasoning: advances and directions

Y Peng, W Zhu, Y Zhao, C Xu, Q Huang, H Lu… - Frontiers of Information …, 2017 - Springer
Cross-media analysis and reasoning is an active research area in computer science, and a
promising direction for artificial intelligence. However, to the best of our knowledge, no …

[HTML][HTML] Heading toward artificial intelligence 2.0

Y Pan - Engineering, 2016 - Elsevier
With the popularization of the Internet, permeation of sensor networks, emergence of big
data, increase in size of the information community, and interlinking and fusion of data and …

CM-GANs: Cross-modal generative adversarial networks for common representation learning

Y Peng, J Qi - ACM Transactions on Multimedia Computing …, 2019 - dl.acm.org
It is known that the inconsistent distributions and representations of different modalities, such
as image and text, cause the heterogeneity gap, which makes it very challenging to correlate …

Cross-modal retrieval with CNN visual features: A new baseline

Y Wei, Y Zhao, C Lu, S Wei, L Liu… - IEEE transactions on …, 2016 - ieeexplore.ieee.org
Recently, convolutional neural network (CNN) visual features have demonstrated their
powerful ability as a universal representation for various recognition tasks. In this paper …

Align and tell: Boosting text-video retrieval with local alignment and fine-grained supervision

X Wang, L Zhu, Z Zheng, M Xu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Text-video retrieval is one of the basic tasks for multimodal research and has been widely
harnessed in many real-world systems. Most existing approaches directly compare the …

Inter-media hashing for large-scale retrieval from heterogeneous data sources

J Song, Y Yang, Y Yang, Z Huang… - Proceedings of the 2013 …, 2013 - dl.acm.org
In this paper, we present a new multimedia retrieval paradigm to innovate large-scale
search of heterogenous multimedia data. It is able to return results of different media types …

Robust joint graph sparse coding for unsupervised spectral feature selection

X Zhu, X Li, S Zhang, C Ju, X Wu - IEEE transactions on neural …, 2016 - ieeexplore.ieee.org
In this paper, we propose a new unsupervised spectral feature selection model by
embedding a graph regularizer into the framework of joint sparse regression for preserving …

On the role of correlation and abstraction in cross-modal multimedia retrieval

JC Pereira, E Coviello, G Doyle… - IEEE transactions on …, 2013 - ieeexplore.ieee.org
The problem of cross-modal retrieval from multimedia repositories is considered. This
problem addresses the design of retrieval systems that support queries across content …