Pose-normalized image generation for person re-identification
Person Re-identification (re-id) faces two major challenges: the lack of cross-view paired
training data and learning discriminative identity-sensitive and view-invariant features in the …
training data and learning discriminative identity-sensitive and view-invariant features in the …
Beyond bilinear: Generalized multimodal factorized high-order pooling for visual question answering
Visual question answering (VQA) is challenging, because it requires a simultaneous
understanding of both visual content of images and textual content of questions. To support …
understanding of both visual content of images and textual content of questions. To support …
Triplet-based deep hashing network for cross-modal retrieval
Given the benefits of its low storage requirements and high retrieval efficiency, hashing has
recently received increasing attention. In particular, cross-modal hashing has been widely …
recently received increasing attention. In particular, cross-modal hashing has been widely …
Scalable person re-identification on supervised smoothed manifold
Most existing person re-identification algorithms either extract robust visual features or learn
discriminative metrics for person images. However, the underlying manifold which those …
discriminative metrics for person images. However, the underlying manifold which those …
From deterministic to generative: Multimodal stochastic RNNs for video captioning
Video captioning, in essential, is a complex natural process, which is affected by various
uncertainties stemming from video content, subjective judgment, and so on. In this paper, we …
uncertainties stemming from video content, subjective judgment, and so on. In this paper, we …
Multi-task consistency-preserving adversarial hashing for cross-modal retrieval
Owing to the advantages of low storage cost and high query efficiency, cross-modal hashing
has received increasing attention recently. As failing to bridge the inherent modality gap …
has received increasing attention recently. As failing to bridge the inherent modality gap …
Multi-target tracking using CNN-based features: CNNMTT
In this paper, we focus mainly on designing a Multi-Target Object Tracking algorithm that
would produce high-quality trajectories while maintaining low computational costs. Using …
would produce high-quality trajectories while maintaining low computational costs. Using …
Unsupervised cross-dataset person re-identification by transfer learning of spatial-temporal patterns
Most of the proposed person re-identification algorithms conduct supervised training and
testing on single labeled datasets with small size, so directly deploying these trained models …
testing on single labeled datasets with small size, so directly deploying these trained models …
Neural person search machines
We investigate the problem of person search in the wild in this work. Instead of comparing
the query against all candidate regions generated in a query-blind manner, we propose to …
the query against all candidate regions generated in a query-blind manner, we propose to …
Graph structure fusion for multiview clustering
Most existing multiview clustering methods take graphs, which are usually predefined
independently in each view, as input to uncover data distribution. These methods ignore the …
independently in each view, as input to uncover data distribution. These methods ignore the …