Object class detection: A survey

X Zhang, YH Yang, Z Han, H Wang, C Gao - ACM Computing Surveys …, 2013 - dl.acm.org
Object class detection, also known as category-level object detection, has become one of
the most focused areas in computer vision in the new century. This article attempts to …

Particular object retrieval with integral max-pooling of CNN activations

G Tolias, R Sicre, H Jégou - arxiv preprint arxiv:1511.05879, 2015 - arxiv.org
Recently, image representation built upon Convolutional Neural Network (CNN) has been
shown to provide effective descriptors for image search, outperforming pre-CNN features as …

Learning to count with cnn boosting

E Walach, L Wolf - Computer Vision–ECCV 2016: 14th European …, 2016 - Springer
In this paper, we address the task of object counting in images. We follow modern learning
approaches in which a density map is estimated directly from the input image. We employ …

Learning to count objects in images

V Lempitsky, A Zisserman - Advances in neural information …, 2010 - proceedings.neurips.cc
We propose a new supervised learning framework for visual object counting tasks, such as
estimating the number of cells in a microscopic image or the number of humans in …

[PDF][PDF] 行人检测技术综述

苏松志, **绍滋, 陈淑媛, 蔡国榕, 吴云东 - 电子学报, 2012 - ejournal.org.cn
行人检测是计算机视觉中的研究热点和难点, 本文对2005-2011 这段时间内的行人检测技术中
最核心的两个问题—特征提取, 分类器与定位—的研究现状进行综述. 文章中首先将这些问题的 …

Cross-dataset action detection

L Cao, Z Liu, TS Huang - 2010 IEEE Computer Society …, 2010 - ieeexplore.ieee.org
In recent years, many research works have been carried out to recognize human actions
from video clips. To learn an effective action classifier, most of the previous approaches rely …

Multi-view object detection in dual-energy X-ray images

M Baştan - Machine Vision and Applications, 2015 - Springer
Automatic inspection of X-ray scans at security checkpoints can improve the public security.
X-ray images are different from photographic images. They are transparent. They contain …

Spatio-temporal action detection with cascade proposal and location anticipation

Z Yang, J Gao, R Nevatia - arxiv preprint arxiv:1708.00042, 2017 - arxiv.org
In this work, we address the problem of spatio-temporal action detection in temporally
untrimmed videos. It is an important and challenging task as finding accurate human actions …

Efficient action localization with approximately normalized fisher vectors

D Oneata, J Verbeek, C Schmid - Proceedings of the IEEE …, 2014 - openaccess.thecvf.com
The Fisher vector (FV) representation is a high-dimensional extension of the popular bag-of-
word representation. Transformation of the FV by power and L2 normalizations has shown to …

Efficient structured parsing of facades using dynamic programming

A Cohen, AG Schwing, M Pollefeys - Proceedings of the IEEE …, 2014 - cv-foundation.org
We propose a sequential optimization technique for segmenting a rectified image of a
facade into semantic categories. Our method retrieves a parsing which respects common …