3D object detection for autonomous driving: A comprehensive survey

J Mao, S Shi, X Wang, H Li - International Journal of Computer Vision, 2023 - Springer
Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …

Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

Cross-view transformers for real-time map-view semantic segmentation

B Zhou, P Krähenbühl - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
We present cross-view transformers, an efficient attention-based model for map-view
semantic segmentation from multiple cameras. Our architecture implicitly learns a map** …

Fcos3d: Fully convolutional one-stage monocular 3d object detection

T Wang, X Zhu, J Pang, D Lin - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Monocular 3D object detection is an important task for autonomous driving considering its
advantage of low cost. It is much more challenging than conventional 2D cases due to its …

Is pseudo-lidar needed for monocular 3d object detection?

D Park, R Ambrus, V Guizilini, J Li… - Proceedings of the …, 2021 - openaccess.thecvf.com
Recent progress in 3D object detection from single images leverages monocular depth
estimation as a way to produce 3D pointclouds, turning cameras into pseudo-lidar sensors …

Categorical depth distribution network for monocular 3d object detection

C Reading, A Harakeh, J Chae… - Proceedings of the …, 2021 - openaccess.thecvf.com
Monocular 3D object detection is a key problem for autonomous vehicles, as it provides a
solution with simple configuration compared to typical multi-sensor systems. The main …

Gdr-net: Geometry-guided direct regression network for monocular 6d object pose estimation

G Wang, F Manhardt, F Tombari… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Abstract 6D pose estimation from a single RGB image is a fundamental task in computer
vision. The current top-performing deep learning-based methods rely on an indirect strategy …

Language-grounded indoor 3d semantic segmentation in the wild

D Rozenberszki, O Litany, A Dai - European Conference on Computer …, 2022 - Springer
Recent advances in 3D semantic segmentation with deep neural networks have shown
remarkable success, with rapid performance increase on available datasets. However …

PV-RCNN++: Point-voxel feature set abstraction with local vector representation for 3D object detection

S Shi, L Jiang, J Deng, Z Wang, C Guo, J Shi… - International Journal of …, 2023 - Springer
Abstract 3D object detection is receiving increasing attention from both industry and
academia thanks to its wide applications in various fields. In this paper, we propose Point …

Objects are different: Flexible monocular 3d object detection

Y Zhang, J Lu, J Zhou - … of the IEEE/CVF Conference on …, 2021 - openaccess.thecvf.com
The precise localization of 3D objects from a single image without depth information is a
highly challenging problem. Most existing methods adopt the same approach for all objects …