Multi-modal 3d object detection in autonomous driving: A survey and taxonomy
Autonomous vehicles require constant environmental perception to obtain the distribution of
obstacles to achieve safe driving. Specifically, 3D object detection is a vital functional …
obstacles to achieve safe driving. Specifically, 3D object detection is a vital functional …
Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …
and drawing extensive attention both from industry and academia. Conventional …
Milestones in autonomous driving and intelligent vehicles: Survey of surveys
Interest in autonomous driving (AD) and intelligent vehicles (IVs) is growing at a rapid pace
due to the convenience, safety, and economic benefits. Although a number of surveys have …
due to the convenience, safety, and economic benefits. Although a number of surveys have …
Dair-v2x: A large-scale dataset for vehicle-infrastructure cooperative 3d object detection
Autonomous driving faces great safety challenges for a lack of global perspective and the
limitation of long-range perception capabilities. It has been widely agreed that vehicle …
limitation of long-range perception capabilities. It has been widely agreed that vehicle …
Deep long-tailed learning: A survey
Deep long-tailed learning, one of the most challenging problems in visual recognition, aims
to train well-performing deep models from a large number of images that follow a long-tailed …
to train well-performing deep models from a large number of images that follow a long-tailed …
Voxel transformer for 3d object detection
Abstract We present Voxel Transformer (VoTr), a novel and effective voxel-based
Transformer backbone for 3D object detection from point clouds. Conventional 3D …
Transformer backbone for 3D object detection from point clouds. Conventional 3D …
Argoverse 2: Next generation datasets for self-driving perception and forecasting
We introduce Argoverse 2 (AV2)-a collection of three datasets for perception and forecasting
research in the self-driving domain. The annotated Sensor Dataset contains 1,000 …
research in the self-driving domain. The annotated Sensor Dataset contains 1,000 …
CLIP2: Contrastive language-image-point pretraining from real-world point cloud data
Abstract Contrastive Language-Image Pre-training, benefiting from large-scale unlabeled
text-image pairs, has demonstrated great performance in open-world vision understanding …
text-image pairs, has demonstrated great performance in open-world vision understanding …
Benchmarking robustness of 3d object detection to common corruptions
Abstract 3D object detection is an important task in autonomous driving to perceive the
surroundings. Despite the excellent performance, the existing 3D detectors lack the …
surroundings. Despite the excellent performance, the existing 3D detectors lack the …
V2x-seq: A large-scale sequential dataset for vehicle-infrastructure cooperative perception and forecasting
Utilizing infrastructure and vehicle-side information to track and forecast the behaviors of
surrounding traffic participants can significantly improve decision-making and safety in …
surrounding traffic participants can significantly improve decision-making and safety in …