A comprehensive survey of transformers for computer vision

S Jamil, M Jalil Piran, OJ Kwon - Drones, 2023 - mdpi.com
As a special type of transformer, vision transformers (ViTs) can be used for various computer
vision (CV) applications. Convolutional neural networks (CNNs) have several potential …

Panoptic segmentation: A review

O Elharrouss, S Al-Maadeed, N Subramanian… - arxiv preprint arxiv …, 2021 - arxiv.org
Image segmentation for video analysis plays an essential role in different research fields
such as smart city, healthcare, computer vision and geoscience, and remote sensing …

Transcrowd: weakly-supervised crowd counting with transformers

D Liang, X Chen, W Xu, Y Zhou, X Bai - Science China Information …, 2022 - Springer
The mainstream crowd counting methods usually utilize the convolution neural network
(CNN) to regress a density map, requiring point-level annotations. However, annotating …

An end-to-end transformer model for crowd localization

D Liang, W Xu, X Bai - European Conference on Computer Vision, 2022 - Springer
Crowd localization, predicting head positions, is a more practical and high-level task than
simply counting. Existing methods employ pseudo-bounding boxes or pre-designed …

Crowdclip: Unsupervised crowd counting via vision-language model

D Liang, J **e, Z Zou, X Ye, W Xu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Supervised crowd counting relies heavily on costly manual labeling, which is difficult and
expensive, especially in dense scenes. To alleviate the problem, we propose a novel …

Efficient-lightweight yolo: Improving small object detection in yolo for aerial images

M Hu, Z Li, J Yu, X Wan, H Tan, Z Lin - Sensors, 2023 - mdpi.com
The most significant technical challenges of current aerial image object-detection tasks are
the extremely low accuracy for detecting small objects that are densely distributed within a …

Spatio-channel attention blocks for cross-modal crowd counting

Y Zhang, S Choi, S Hong - Proceedings of the Asian …, 2022 - openaccess.thecvf.com
Crowd counting research has made significant advancements in real-world applications, but
it remains a formidable challenge in cross modal settings. Most existing methods rely solely …

Dilated-scale-aware category-attention convnet for multi-class object counting

W Xu, D Liang, Y Zheng, J **e… - IEEE Signal Processing …, 2021 - ieeexplore.ieee.org
Object counting aims to estimate the number of objects in images. The leading counting
approaches focus on single-category counting tasks and achieve impressive performance …

[HTML][HTML] Enhanced yolov8-based model with context enrichment module for crowd counting in complex drone imagery

AN Alhawsawi, SD Khan, FU Rehman - Remote Sensing, 2024 - mdpi.com
Crowd counting in aerial images presents unique challenges due to varying altitudes,
angles, and cluttered backgrounds. Additionally, the small size of targets, often occupying …

Density-based clustering with fully-convolutional networks for crowd flow detection from drones

G Castellano, E Cotardo, C Mencar, G Vessio - Neurocomputing, 2023 - Elsevier
Crowd analysis from drones has attracted increasing attention in recent times due to the
ease of use and affordable cost of these devices. However, how this technology can provide …