Deep learning on monocular object pose detection and tracking: A comprehensive overview

Z Fan, Y Zhu, Y He, Q Sun, H Liu, J He - ACM Computing Surveys, 2022 - dl.acm.org
Object pose detection and tracking has recently attracted increasing attention due to its wide
applications in many areas, such as autonomous driving, robotics, and augmented reality …

Dair-v2x: A large-scale dataset for vehicle-infrastructure cooperative 3d object detection

H Yu, Y Luo, M Shu, Y Huo, Z Yang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Autonomous driving faces great safety challenges for a lack of global perspective and the
limitation of long-range perception capabilities. It has been widely agreed that vehicle …

Multipath++: Efficient information fusion and trajectory aggregation for behavior prediction

B Varadarajan, A Hefny, A Srivastava… - … on Robotics and …, 2022 - ieeexplore.ieee.org
Predicting the future behavior of road users is one of the most challenging and important
problems in autonomous driving. Applying deep learning to this problem requires fusing …

Simpleclick: Interactive image segmentation with simple vision transformers

Q Liu, Z Xu, G Bertasius… - Proceedings of the …, 2023 - openaccess.thecvf.com
Click-based interactive image segmentation aims at extracting objects with a limited user
clicking. A hierarchical backbone is the de-facto architecture for current methods. Recently …

Epro-pnp: Generalized end-to-end probabilistic perspective-n-points for monocular object pose estimation

H Chen, P Wang, F Wang, W Tian… - Proceedings of the …, 2022 - openaccess.thecvf.com
Locating 3D objects from a single RGB image via Perspective-n-Points (PnP) is a long-
standing problem in computer vision. Driven by end-to-end deep learning, recent studies …

Vision-centric bev perception: A survey

Y Ma, T Wang, X Bai, H Yang, Y Hou… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
In recent years, vision-centric Bird's Eye View (BEV) perception has garnered significant
interest from both industry and academia due to its inherent advantages, such as providing …

Image-to-lidar self-supervised distillation for autonomous driving data

C Sautier, G Puy, S Gidaris, A Boulch… - Proceedings of the …, 2022 - openaccess.thecvf.com
Segmenting or detecting objects in sparse Lidar point clouds are two important tasks in
autonomous driving to allow a vehicle to act safely in its 3D environment. The best …

Voxel field fusion for 3d object detection

Y Li, X Qi, Y Chen, L Wang, Z Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this work, we present a conceptually simple yet effective framework for cross-modality 3D
object detection, named voxel field fusion. The proposed approach aims to maintain cross …

Boreas: A multi-season autonomous driving dataset

K Burnett, DJ Yoon, Y Wu, AZ Li… - … Journal of Robotics …, 2023 - journals.sagepub.com
The Boreas dataset was collected by driving a repeated route over the course of 1 year,
resulting in stark seasonal variations and adverse weather conditions such as rain and …

VPFNet: Improving 3D object detection with virtual point based LiDAR and stereo data fusion

H Zhu, J Deng, Y Zhang, J Ji, Q Mao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
It has been well recognized that fusing the complementary information from depth-aware
LiDAR point clouds and semantic-rich stereo images would benefit 3D object detection …