3D object detection for autonomous driving: A comprehensive survey

J Mao, S Shi, X Wang, H Li - International Journal of Computer Vision, 2023 - Springer
Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …

Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

Bevfusion: A simple and robust lidar-camera fusion framework

T Liang, H **e, K Yu, Z **a, Z Lin… - Advances in …, 2022 - proceedings.neurips.cc
Fusing the camera and LiDAR information has become a de-facto standard for 3D object
detection tasks. Current methods rely on point clouds from the LiDAR sensor as queries to …

Bevdet: High-performance multi-camera 3d object detection in bird-eye-view

J Huang, G Huang, Z Zhu, Y Ye, D Du - arxiv preprint arxiv:2112.11790, 2021 - arxiv.org
Autonomous driving perceives its surroundings for decision making, which is one of the most
complex scenarios in visual perception. The success of paradigm innovation in solving the …

Bevdet4d: Exploit temporal cues in multi-camera 3d object detection

J Huang, G Huang - arxiv preprint arxiv:2203.17054, 2022 - arxiv.org
Single frame data contains finite information which limits the performance of the existing
vision-based multi-camera 3D object detection paradigms. For fundamentally pushing the …

Monodtr: Monocular 3d object detection with depth-aware transformer

KC Huang, TH Wu, HT Su… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Monocular 3D object detection is an important yet challenging task in autonomous driving.
Some existing methods leverage depth information from an off-the-shelf depth estimator to …

Omni3d: A large benchmark and model for 3d object detection in the wild

G Brazil, A Kumar, J Straub, N Ravi… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recognizing scenes and objects in 3D from a single image is a longstanding goal of
computer vision with applications in robotics and AR/VR. For 2D recognition, large datasets …

Autoshape: Real-time shape-aware monocular 3d object detection

Z Liu, D Zhou, F Lu, J Fang… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Existing deep learning-based approaches for monocular 3D object detection in autonomous
driving often model the object as a rotated 3D cuboid while the object's geometric shape has …

Learning auxiliary monocular contexts helps monocular 3d object detection

X Liu, N Xue, T Wu - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
Monocular 3D object detection aims to localize 3D bounding boxes in an input single 2D
image. It is a highly challenging problem and remains open, especially when no extra …

3d object detection from images for autonomous driving: a survey

X Ma, W Ouyang, A Simonelli… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
3D object detection from images, one of the fundamental and challenging problems in
autonomous driving, has received increasing attention from both industry and academia in …