3D object detection for autonomous driving: A comprehensive survey
Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …
Deep learning for 3d point clouds: A survey
Point cloud learning has lately attracted increasing attention due to its wide applications in
many areas, such as computer vision, autonomous driving, and robotics. As a dominating …
many areas, such as computer vision, autonomous driving, and robotics. As a dominating …
Transfusion: Robust lidar-camera fusion for 3d object detection with transformers
LiDAR and camera are two important sensors for 3D object detection in autonomous driving.
Despite the increasing popularity of sensor fusion in this field, the robustness against inferior …
Despite the increasing popularity of sensor fusion in this field, the robustness against inferior …
Surface representation for point clouds
Most prior work represents the shapes of point clouds by coordinates. However, it is
insufficient to describe the local geometry directly. In this paper, we present RepSurf …
insufficient to describe the local geometry directly. In this paper, we present RepSurf …
Multimodal token fusion for vision transformers
Many adaptations of transformers have emerged to address the single-modal vision tasks,
where self-attention modules are stacked to handle input sources like images. Intuitively …
where self-attention modules are stacked to handle input sources like images. Intuitively …
Deepinteraction: 3d object detection via modality interaction
Existing top-performance 3D object detectors typically rely on the multi-modal fusion
strategy. This design is however fundamentally restricted due to overlooking the modality …
strategy. This design is however fundamentally restricted due to overlooking the modality …
Searching efficient 3d architectures with sparse point-voxel convolution
Self-driving cars need to understand 3D scenes efficiently and accurately in order to drive
safely. Given the limited hardware resources, existing 3D perception models are not able to …
safely. Given the limited hardware resources, existing 3D perception models are not able to …
Pointcontrast: Unsupervised pre-training for 3d point cloud understanding
Arguably one of the top success stories of deep learning is transfer learning. The finding that
pre-training a network on a rich source set (eg, ImageNet) can help boost performance once …
pre-training a network on a rich source set (eg, ImageNet) can help boost performance once …
Group-free 3d object detection via transformers
Recently, directly detecting 3D objects from 3D point clouds has received increasing
attention. To extract object representation from an irregular point cloud, existing methods …
attention. To extract object representation from an irregular point cloud, existing methods …
Open-vocabulary queryable scene representations for real world planning
Large language models (LLMs) have unlocked new capabilities of task planning from
human instructions. However, prior attempts to apply LLMs to real-world robotic tasks are …
human instructions. However, prior attempts to apply LLMs to real-world robotic tasks are …