Scale-mae: A scale-aware masked autoencoder for multiscale geospatial representation learning

CJ Reed, R Gupta, S Li, S Brockman… - Proceedings of the …, 2023 - openaccess.thecvf.com
Large, pretrained models are commonly finetuned with imagery that is heavily augmented to
mimic different conditions and scales, with the resulting models used for various tasks with …

Deep fisher networks for large-scale image classification

K Simonyan, A Vedaldi… - Advances in neural …, 2013 - proceedings.neurips.cc
As massively parallel computations have become broadly available with modern GPUs,
deep architectures trained on very large datasets have risen in popularity. Discriminatively …

Cross-Scale MAE: A tale of multiscale exploitation in remote sensing

M Tang, A Cozma, K Georgiou… - Advances in Neural …, 2024 - proceedings.neurips.cc
Remote sensing images present unique challenges to image analysis due to the extensive
geographic coverage, hardware limitations, and misaligned multi-scale images. This paper …

Learning discriminative part detectors for image classification and cosegmentation

J Sun, J Ponce - … of the IEEE international conference on …, 2013 - openaccess.thecvf.com
In this paper, we address the problem of learning discriminative part detectors from image
sets with category labels. We propose a novel latent SVM model regularized by group …

Soft margin multiple kernel learning

X Xu, IW Tsang, D Xu - IEEE transactions on neural networks …, 2013 - ieeexplore.ieee.org
Multiple kernel learning (MKL) has been proposed for kernel methods by learning the
optimal kernel from a set of predefined base kernels. However, the traditional L 1 MKL …

Large-scale video retrieval using image queries

A Araujo, B Girod - IEEE transactions on circuits and systems …, 2017 - ieeexplore.ieee.org
Retrieving videos from large repositories using image queries is important for many
applications, such as brand monitoring or content linking. We introduce a new retrieval …

A fine-grained image categorization system by cellet-encoded spatial pyramid modeling

L Zhang, Y Gao, Y **a, Q Dai… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
In this paper, a new fine-grained image categorization system that improves spatial pyramid
matching is developed. In this method, multiple cells are combined into cellets in the …

Expanded parts model for human attribute and action recognition in still images

G Sharma, F Jurie, C Schmid - proceedings of the IEEE …, 2013 - openaccess.thecvf.com
We propose a new model for recognizing human attributes (eg wearing a suit, sitting, short
hair) and actions (eg running, riding a horse) in still images. The proposed model relies on a …

Encoding high dimensional local features by sparse coding based fisher vectors

L Liu, C Shen, L Wang… - Advances in neural …, 2014 - proceedings.neurips.cc
Deriving from the gradient vector of a generative model of local features, Fisher vector
coding (FVC) has been identified as an effective coding method for image classification …

[PDF][PDF] Regularized max pooling for image categorization

M Hoai12 - Proceedings of the British Machine Vision Conference, 2014 - robots.ox.ac.uk
Abstract We propose Regularized Max Pooling (RMP) for image classification. RMP
classifies an image (or an image region) by extracting feature vectors at multiple …