A brief survey on semantic segmentation with deep learning

S Hao, Y Zhou, Y Guo - Neurocomputing, 2020 - Elsevier
Semantic segmentation is a challenging task in computer vision. In recent years, the
performance of semantic segmentation has been greatly improved by using deep learning …

Recent advances in deep learning for object detection

X Wu, D Sahoo, SCH Hoi - Neurocomputing, 2020 - Elsevier
Object detection is a fundamental visual recognition problem in computer vision and has
been widely studied in the past decades. Visual object detection aims to find objects of …

Mpdiou: a loss for efficient and accurate bounding box regression

S Ma, Y Xu - arxiv preprint arxiv:2307.07662, 2023 - arxiv.org
Bounding box regression (BBR) has been widely used in object detection and instance
segmentation, which is an important step in object localization. However, most of the existing …

Enhancing geometric factors in model learning and inference for object detection and instance segmentation

Z Zheng, P Wang, D Ren, W Liu, R Ye… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Deep learning-based object detection and instance segmentation have achieved
unprecedented progress. In this article, we propose complete-IoU (CIoU) loss and Cluster …

M3d-rpn: Monocular 3d region proposal network for object detection

G Brazil, X Liu - Proceedings of the IEEE/CVF international …, 2019 - openaccess.thecvf.com
Understanding the world in 3D is a critical component of urban autonomous driving.
Generally, the combination of expensive LiDAR sensors and stereo RGB imaging has been …

Occlusion-aware R-CNN: Detecting pedestrians in a crowd

S Zhang, L Wen, X Bian, Z Lei… - Proceedings of the …, 2018 - openaccess.thecvf.com
Pedestrian detection in crowded scenes is a challenging problem since the pedestrians
often gather together and occlude each other. In this paper, we propose a new occlusion …

Improving multispectral pedestrian detection by addressing modality imbalance problems

K Zhou, L Chen, X Cao - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
Multispectral pedestrian detection is capable of adapting to insufficient illumination
conditions by leveraging color-thermal modalities. On the other hand, it is still lacking of in …

Learning center probability map for detecting objects in aerial images

J Wang, W Yang, HC Li, H Zhang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
One fundamental problem in Earth Vision is to accurately find the locations and identify the
categories of the interesting objects in the aerial images, for which oriented bounding boxes …

Illumination-aware faster R-CNN for robust multispectral pedestrian detection

C Li, D Song, R Tong, M Tang - Pattern Recognition, 2019 - Elsevier
Multispectral images of color-thermal pairs have shown more effective than a single color
channel for pedestrian detection, especially under challenging illumination conditions …

Gait recognition via disentangled representation learning

Z Zhang, L Tran, X Yin, Y Atoum, X Liu… - Proceedings of the …, 2019 - openaccess.thecvf.com
Gait, the walking pattern of individuals, is one of the most important biometrics modalities.
Most of the existing gait recognition methods take silhouettes or articulated body models as …