Rethinking spatial invariance of convolutional networks for object counting
Previous work generally believes that improving the spatial invariance of convolutional
networks is the key to object counting. However, after verifying several mainstream counting …
networks is the key to object counting. However, after verifying several mainstream counting …
Transcrowd: weakly-supervised crowd counting with transformers
The mainstream crowd counting methods usually utilize the convolution neural network
(CNN) to regress a density map, requiring point-level annotations. However, annotating …
(CNN) to regress a density map, requiring point-level annotations. However, annotating …
An end-to-end transformer model for crowd localization
Crowd localization, predicting head positions, is a more practical and high-level task than
simply counting. Existing methods employ pseudo-bounding boxes or pre-designed …
simply counting. Existing methods employ pseudo-bounding boxes or pre-designed …
Deep learning in crowd counting: A survey
Counting high‐density objects quickly and accurately is a popular area of research. Crowd
counting has significant social and economic value and is a major focus in artificial …
counting has significant social and economic value and is a major focus in artificial …
Steerer: Resolving scale variations for counting and localization via selective inheritance learning
Scale variation is a deep-rooted problem in object counting, which has not been effectively
addressed by existing scale-aware algorithms. An important factor is that they typically …
addressed by existing scale-aware algorithms. An important factor is that they typically …
Optimal transport minimization: Crowd localization on density maps for semi-supervised counting
The accuracy of crowd counting in images has improved greatly in recent years due to the
development of deep neural networks for predicting crowd density maps. However, most …
development of deep neural networks for predicting crowd density maps. However, most …
Crowdclip: Unsupervised crowd counting via vision-language model
Supervised crowd counting relies heavily on costly manual labeling, which is difficult and
expensive, especially in dense scenes. To alleviate the problem, we propose a novel …
expensive, especially in dense scenes. To alleviate the problem, we propose a novel …
Learning to count everything
Existing works on visual counting primarily focus on one specific category at a time, such as
people, animals, and cells. In this paper, we are interested in counting everything, that is to …
people, animals, and cells. In this paper, we are interested in counting everything, that is to …
Represent, compare, and learn: A similarity-aware framework for class-agnostic counting
Class-agnostic counting (CAC) aims to count all instances in a query image given few
exemplars. A standard pipeline is to extract visual features from exemplars and match them …
exemplars. A standard pipeline is to extract visual features from exemplars and match them …
Focal inverse distance transform maps for crowd localization
In this paper, we focus on the crowd localization task, a crucial topic of crowd analysis. Most
regression-based methods utilize convolution neural networks (CNN) to regress a density …
regression-based methods utilize convolution neural networks (CNN) to regress a density …