3D object detection for autonomous driving: A comprehensive survey

J Mao, S Shi, X Wang, H Li - International Journal of Computer Vision, 2023 - Springer
Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …

Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

Tri-perspective view for vision-based 3d semantic occupancy prediction

Y Huang, W Zheng, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Modern methods for vision-centric autonomous driving perception widely adopt the bird's-
eye-view (BEV) representation to describe a 3D scene. Despite its better efficiency than …

Planning-oriented autonomous driving

Y Hu, J Yang, L Chen, K Li, C Sima… - Proceedings of the …, 2023 - openaccess.thecvf.com
Modern autonomous driving system is characterized as modular tasks in sequential order,
ie, perception, prediction, and planning. In order to perform a wide diversity of tasks and …

Surroundocc: Multi-camera 3d occupancy prediction for autonomous driving

Y Wei, L Zhao, W Zheng, Z Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D scene understanding plays a vital role in vision-based autonomous driving.
While most existing methods focus on 3D object detection, they have difficulty describing …

Vad: Vectorized scene representation for efficient autonomous driving

B Jiang, S Chen, Q Xu, B Liao, J Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Autonomous driving requires a comprehensive understanding of the surrounding
environment for reliable trajectory planning. Previous works rely on dense rasterized scene …

Occformer: Dual-path transformer for vision-based 3d semantic occupancy prediction

Y Zhang, Z Zhu, D Du - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
The vision-based perception for autonomous driving has undergone a transformation from
the bird-eye-view (BEV) representations to the 3D semantic occupancy. Compared with the …

DriveDreamer: Towards Real-World-Drive World Models for Autonomous Driving

X Wang, Z Zhu, G Huang, X Chen, J Zhu… - European Conference on …, 2024 - Springer
World models, especially in autonomous driving, are trending and drawing extensive
attention due to their capacity for comprehending driving environments. The established …

Petrv2: A unified framework for 3d perception from multi-camera images

Y Liu, J Yan, F Jia, S Li, A Gao… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we propose PETRv2, a unified framework for 3D perception from multi-view
images. Based on PETR, PETRv2 explores the effectiveness of temporal modeling, which …

Selfocc: Self-supervised vision-based 3d occupancy prediction

Y Huang, W Zheng, B Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract 3D occupancy prediction is an important task for the robustness of vision-centric
autonomous driving which aims to predict whether each point is occupied in the surrounding …