Deep fuzzy hashing network for efficient image retrieval

H Lu, M Zhang, X Xu, Y Li… - IEEE transactions on fuzzy …, 2020 - ieeexplore.ieee.org
Hashing methods for efficient image retrieval aim at learning hash functions that map similar
images to semantically correlated binary codes in the Hamming space with similarity well …

Exploiting subspace relation in semantic labels for cross-modal hashing

HT Shen, L Liu, Y Yang, X Xu, Z Huang… - … on Knowledge and …, 2020 - ieeexplore.ieee.org
Hashing methods have been extensively applied to efficient multimedia data indexing and
retrieval on account of the explosion of multimedia data. Cross-modal hashing usually …

Aggregation-based graph convolutional hashing for unsupervised cross-modal retrieval

PF Zhang, Y Li, Z Huang, XS Xu - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Cross-modal hashing has sparked much attention in large-scale information retrieval for its
storage and query efficiency. Despite the great success achieved by supervised …

Describing video with attention-based bidirectional LSTM

Y Bin, Y Yang, F Shen, N **e… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Video captioning has been attracting broad research attention in the multimedia community.
However, most existing approaches heavily rely on static visual information or partially …

Cycle-consistent deep generative hashing for cross-modal retrieval

L Wu, Y Wang, L Shao - IEEE Transactions on Image …, 2018 - ieeexplore.ieee.org
In this paper, we propose a novel deep generative approach to cross-modal retrieval to
learn hash functions in the absence of paired training samples through the cycle consistency …

Video captioning by adversarial LSTM

Y Yang, J Zhou, J Ai, Y Bin, A Hanjalic… - … on Image Processing, 2018 - ieeexplore.ieee.org
In this paper, we propose a novel approach to video captioning based on adversarial
learning and long short-term memory (LSTM). With this solution concept, we aim at …

Video coding optimization for virtual reality 360-degree source

Y Zhou, L Tian, C Zhu, X **… - IEEE Journal of Selected …, 2019 - ieeexplore.ieee.org
To provide excellent visual experience for customers, virtual reality (VR) sources require
higher resolutions and better visual quality than traditional picture sequences. The content of …

Toward effective intrusion detection using log-cosh conditional variational autoencoder

X Xu, J Li, Y Yang, F Shen - IEEE Internet of Things Journal, 2020 - ieeexplore.ieee.org
Intrusion detection is an important technique that can provide solid protection for the network
equipment against the security attacks. However, the attacks are usually unbalanced in …

MRA-Net: Improving VQA via multi-modal relation attention network

L Peng, Y Yang, Z Wang, Z Huang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Visual Question Answering (VQA) is a task to answer natural language questions tied to the
content of visual images. Most recent VQA approaches usually apply attention mechanism to …

Discrete multimodal hashing with canonical views for robust mobile landmark search

L Zhu, Z Huang, X Liu, X He, J Sun… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Mobile landmark search (MLS) recently receives increasing attention for its great practical
values. However, it still remains unsolved due to two important challenges. One is high …