A survey of traditional and deep learning-based feature descriptors for high dimensional data in computer vision

T Georgiou, Y Liu, W Chen, M Lew - International Journal of Multimedia …, 2020 - Springer
Higher dimensional data such as video and 3D are the leading edge of multimedia retrieval
and computer vision research. In this survey, we give a comprehensive overview and key …

A neural theory of binocular rivalry.

R Blake - Psychological review, 1989 - psycnet.apa.org
When the two eyes view discrepant monocular stimuli, stable single vision gives way to
alternating periods of monocular dominance; this is the well-known but little understood …

Deformable convolutional networks

J Dai, H Qi, Y **ong, Y Li, G Zhang… - Proceedings of the …, 2017 - openaccess.thecvf.com
Convolutional neural networks (CNNs) are inherently limited to model geometric
transformations due to the fixed geometric structures in its building modules. In this work, we …

HPatches: A benchmark and evaluation of handcrafted and learned local descriptors

V Balntas, K Lenc, A Vedaldi… - Proceedings of the …, 2017 - openaccess.thecvf.com
In this paper, we propose a novel benchmark for evaluating local image descriptors. We
demonstrate that the existing datasets and evaluation protocols do not specify …

Inside-outside net: Detecting objects in context with skip pooling and recurrent neural networks

S Bell, CL Zitnick, K Bala… - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
It is well known that contextual and multi-scale representations are important for accurate
visual recognition. In this paper we present the Inside-Outside Net (ION), an object detector …

Fully convolutional networks for semantic segmentation

J Long, E Shelhamer, T Darrell - Proceedings of the IEEE …, 2015 - openaccess.thecvf.com
Convolutional networks are powerful visual models that yield hierarchies of features. We
show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels …

Hypercolumns for object segmentation and fine-grained localization

B Hariharan, P Arbeláez… - Proceedings of the …, 2015 - openaccess.thecvf.com
Recognition algorithms based on convolutional networks (CNNs) typically use the output of
the last layer as feature representation. However, the information in this layer may be too …

Cross modal distillation for supervision transfer

S Gupta, J Hoffman, J Malik - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
In this work we propose a technique that transfers supervision between images from
different modalities. We use learned representations from a large labeled modality as …

Fast feature pyramids for object detection

P Dollár, R Appel, S Belongie… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
Multi-resolution image features may be approximated via extrapolation from nearby scales,
rather than being computed explicitly. This fundamental insight allows us to design object …

A performance evaluation of local descriptors

K Mikolajczyk, C Schmid - IEEE transactions on pattern …, 2005 - ieeexplore.ieee.org
In this paper, we compare the performance of descriptors computed for local interest
regions, as, for example, extracted by the Harris-Affine detector [Mikolajczyk, K and Schmid …