A survey of traditional and deep learning-based feature descriptors for high dimensional data in computer vision
Higher dimensional data such as video and 3D are the leading edge of multimedia retrieval
and computer vision research. In this survey, we give a comprehensive overview and key …
and computer vision research. In this survey, we give a comprehensive overview and key …
A neural theory of binocular rivalry.
R Blake - Psychological review, 1989 - psycnet.apa.org
When the two eyes view discrepant monocular stimuli, stable single vision gives way to
alternating periods of monocular dominance; this is the well-known but little understood …
alternating periods of monocular dominance; this is the well-known but little understood …
Deformable convolutional networks
Convolutional neural networks (CNNs) are inherently limited to model geometric
transformations due to the fixed geometric structures in its building modules. In this work, we …
transformations due to the fixed geometric structures in its building modules. In this work, we …
HPatches: A benchmark and evaluation of handcrafted and learned local descriptors
In this paper, we propose a novel benchmark for evaluating local image descriptors. We
demonstrate that the existing datasets and evaluation protocols do not specify …
demonstrate that the existing datasets and evaluation protocols do not specify …
Inside-outside net: Detecting objects in context with skip pooling and recurrent neural networks
It is well known that contextual and multi-scale representations are important for accurate
visual recognition. In this paper we present the Inside-Outside Net (ION), an object detector …
visual recognition. In this paper we present the Inside-Outside Net (ION), an object detector …
Fully convolutional networks for semantic segmentation
Convolutional networks are powerful visual models that yield hierarchies of features. We
show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels …
show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels …
Hypercolumns for object segmentation and fine-grained localization
Recognition algorithms based on convolutional networks (CNNs) typically use the output of
the last layer as feature representation. However, the information in this layer may be too …
the last layer as feature representation. However, the information in this layer may be too …
Cross modal distillation for supervision transfer
In this work we propose a technique that transfers supervision between images from
different modalities. We use learned representations from a large labeled modality as …
different modalities. We use learned representations from a large labeled modality as …
Fast feature pyramids for object detection
Multi-resolution image features may be approximated via extrapolation from nearby scales,
rather than being computed explicitly. This fundamental insight allows us to design object …
rather than being computed explicitly. This fundamental insight allows us to design object …
A performance evaluation of local descriptors
In this paper, we compare the performance of descriptors computed for local interest
regions, as, for example, extracted by the Harris-Affine detector [Mikolajczyk, K and Schmid …
regions, as, for example, extracted by the Harris-Affine detector [Mikolajczyk, K and Schmid …