A comprehensive survey of transformers for computer vision
As a special type of transformer, vision transformers (ViTs) can be used for various computer
vision (CV) applications. Convolutional neural networks (CNNs) have several potential …
vision (CV) applications. Convolutional neural networks (CNNs) have several potential …
Panoptic segmentation: A review
Image segmentation for video analysis plays an essential role in different research fields
such as smart city, healthcare, computer vision and geoscience, and remote sensing …
such as smart city, healthcare, computer vision and geoscience, and remote sensing …
Transcrowd: weakly-supervised crowd counting with transformers
The mainstream crowd counting methods usually utilize the convolution neural network
(CNN) to regress a density map, requiring point-level annotations. However, annotating …
(CNN) to regress a density map, requiring point-level annotations. However, annotating …
An end-to-end transformer model for crowd localization
Crowd localization, predicting head positions, is a more practical and high-level task than
simply counting. Existing methods employ pseudo-bounding boxes or pre-designed …
simply counting. Existing methods employ pseudo-bounding boxes or pre-designed …
Crowdclip: Unsupervised crowd counting via vision-language model
Supervised crowd counting relies heavily on costly manual labeling, which is difficult and
expensive, especially in dense scenes. To alleviate the problem, we propose a novel …
expensive, especially in dense scenes. To alleviate the problem, we propose a novel …
Efficient-lightweight yolo: Improving small object detection in yolo for aerial images
M Hu, Z Li, J Yu, X Wan, H Tan, Z Lin - Sensors, 2023 - mdpi.com
The most significant technical challenges of current aerial image object-detection tasks are
the extremely low accuracy for detecting small objects that are densely distributed within a …
the extremely low accuracy for detecting small objects that are densely distributed within a …
Spatio-channel attention blocks for cross-modal crowd counting
Crowd counting research has made significant advancements in real-world applications, but
it remains a formidable challenge in cross modal settings. Most existing methods rely solely …
it remains a formidable challenge in cross modal settings. Most existing methods rely solely …
Dilated-scale-aware category-attention convnet for multi-class object counting
Object counting aims to estimate the number of objects in images. The leading counting
approaches focus on single-category counting tasks and achieve impressive performance …
approaches focus on single-category counting tasks and achieve impressive performance …
[HTML][HTML] Enhanced yolov8-based model with context enrichment module for crowd counting in complex drone imagery
Crowd counting in aerial images presents unique challenges due to varying altitudes,
angles, and cluttered backgrounds. Additionally, the small size of targets, often occupying …
angles, and cluttered backgrounds. Additionally, the small size of targets, often occupying …
Density-based clustering with fully-convolutional networks for crowd flow detection from drones
Crowd analysis from drones has attracted increasing attention in recent times due to the
ease of use and affordable cost of these devices. However, how this technology can provide …
ease of use and affordable cost of these devices. However, how this technology can provide …