Deep fuzzy hashing network for efficient image retrieval
Hashing methods for efficient image retrieval aim at learning hash functions that map similar
images to semantically correlated binary codes in the Hamming space with similarity well …
images to semantically correlated binary codes in the Hamming space with similarity well …
Exploiting subspace relation in semantic labels for cross-modal hashing
Hashing methods have been extensively applied to efficient multimedia data indexing and
retrieval on account of the explosion of multimedia data. Cross-modal hashing usually …
retrieval on account of the explosion of multimedia data. Cross-modal hashing usually …
Aggregation-based graph convolutional hashing for unsupervised cross-modal retrieval
Cross-modal hashing has sparked much attention in large-scale information retrieval for its
storage and query efficiency. Despite the great success achieved by supervised …
storage and query efficiency. Despite the great success achieved by supervised …
Describing video with attention-based bidirectional LSTM
Video captioning has been attracting broad research attention in the multimedia community.
However, most existing approaches heavily rely on static visual information or partially …
However, most existing approaches heavily rely on static visual information or partially …
Cycle-consistent deep generative hashing for cross-modal retrieval
In this paper, we propose a novel deep generative approach to cross-modal retrieval to
learn hash functions in the absence of paired training samples through the cycle consistency …
learn hash functions in the absence of paired training samples through the cycle consistency …
Video captioning by adversarial LSTM
In this paper, we propose a novel approach to video captioning based on adversarial
learning and long short-term memory (LSTM). With this solution concept, we aim at …
learning and long short-term memory (LSTM). With this solution concept, we aim at …
Video coding optimization for virtual reality 360-degree source
To provide excellent visual experience for customers, virtual reality (VR) sources require
higher resolutions and better visual quality than traditional picture sequences. The content of …
higher resolutions and better visual quality than traditional picture sequences. The content of …
Toward effective intrusion detection using log-cosh conditional variational autoencoder
Intrusion detection is an important technique that can provide solid protection for the network
equipment against the security attacks. However, the attacks are usually unbalanced in …
equipment against the security attacks. However, the attacks are usually unbalanced in …
MRA-Net: Improving VQA via multi-modal relation attention network
Visual Question Answering (VQA) is a task to answer natural language questions tied to the
content of visual images. Most recent VQA approaches usually apply attention mechanism to …
content of visual images. Most recent VQA approaches usually apply attention mechanism to …
Discrete multimodal hashing with canonical views for robust mobile landmark search
Mobile landmark search (MLS) recently receives increasing attention for its great practical
values. However, it still remains unsolved due to two important challenges. One is high …
values. However, it still remains unsolved due to two important challenges. One is high …