Multi-modal 3d object detection in autonomous driving: A survey and taxonomy

L Wang, X Zhang, Z Song, J Bi, G Zhang… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Autonomous vehicles require constant environmental perception to obtain the distribution of
obstacles to achieve safe driving. Specifically, 3D object detection is a vital functional …

Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

Milestones in autonomous driving and intelligent vehicles: Survey of surveys

L Chen, Y Li, C Huang, B Li, Y **ng… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Interest in autonomous driving (AD) and intelligent vehicles (IVs) is growing at a rapid pace
due to the convenience, safety, and economic benefits. Although a number of surveys have …

Dair-v2x: A large-scale dataset for vehicle-infrastructure cooperative 3d object detection

H Yu, Y Luo, M Shu, Y Huo, Z Yang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Autonomous driving faces great safety challenges for a lack of global perspective and the
limitation of long-range perception capabilities. It has been widely agreed that vehicle …

Deep long-tailed learning: A survey

Y Zhang, B Kang, B Hooi, S Yan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Deep long-tailed learning, one of the most challenging problems in visual recognition, aims
to train well-performing deep models from a large number of images that follow a long-tailed …

Voxel transformer for 3d object detection

J Mao, Y Xue, M Niu, H Bai, J Feng… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract We present Voxel Transformer (VoTr), a novel and effective voxel-based
Transformer backbone for 3D object detection from point clouds. Conventional 3D …

Argoverse 2: Next generation datasets for self-driving perception and forecasting

B Wilson, W Qi, T Agarwal, J Lambert, J Singh… - arxiv preprint arxiv …, 2023 - arxiv.org
We introduce Argoverse 2 (AV2)-a collection of three datasets for perception and forecasting
research in the self-driving domain. The annotated Sensor Dataset contains 1,000 …

CLIP2: Contrastive language-image-point pretraining from real-world point cloud data

Y Zeng, C Jiang, J Mao, J Han, C Ye… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Contrastive Language-Image Pre-training, benefiting from large-scale unlabeled
text-image pairs, has demonstrated great performance in open-world vision understanding …

Benchmarking robustness of 3d object detection to common corruptions

Y Dong, C Kang, J Zhang, Z Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D object detection is an important task in autonomous driving to perceive the
surroundings. Despite the excellent performance, the existing 3D detectors lack the …

V2x-seq: A large-scale sequential dataset for vehicle-infrastructure cooperative perception and forecasting

H Yu, W Yang, H Ruan, Z Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Utilizing infrastructure and vehicle-side information to track and forecast the behaviors of
surrounding traffic participants can significantly improve decision-making and safety in …