Mapdistill: Boosting efficient camera-based hd map construction via camera-lidar fusion model distillation
Online high-definition (HD) map construction is an important and challenging task in
autonomous driving. Recently, there has been a growing interest in cost-effective multi-view …
autonomous driving. Recently, there has been a growing interest in cost-effective multi-view …
4d contrastive superflows are dense 3d representation learners
In the realm of autonomous driving, accurate 3D perception is the foundation. However,
develo** such models relies on extensive human annotations–a process that is both …
develo** such models relies on extensive human annotations–a process that is both …
[HTML][HTML] See the Unseen: Grid-Wise Drivable Area Detection Dataset and Network Using LiDAR
Drivable Area (DA) detection is crucial for autonomous driving. Camera-based methods rely
heavily on illumination conditions and often fail to capture accurate 3D information, while …
heavily on illumination conditions and often fail to capture accurate 3D information, while …
UniDrive: Towards Universal Driving Perception Across Camera Configurations
Vision-centric autonomous driving has demonstrated excellent performance with
economical sensors. As the fundamental step, 3D perception aims to infer 3D information …
economical sensors. As the fundamental step, 3D perception aims to infer 3D information …
MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception
Multi-sensor fusion models play a crucial role in autonomous driving perception, particularly
in tasks like 3D object detection and HD map construction. These models provide essential …
in tasks like 3D object detection and HD map construction. These models provide essential …
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes
LiDAR data pretraining offers a promising approach to leveraging large-scale, readily
available datasets for enhanced data utilization. However, existing methods predominantly …
available datasets for enhanced data utilization. However, existing methods predominantly …
GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency
Identifying affordance regions on 3D objects from semantic cues is essential for robotics and
human-machine interaction. However, existing 3D affordance learning methods struggle …
human-machine interaction. However, existing 3D affordance learning methods struggle …
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving
Recent advancements in vision foundation models (VFMs) have revolutionized visual
perception in 2D, yet their potential for 3D scene understanding, particularly in autonomous …
perception in 2D, yet their potential for 3D scene understanding, particularly in autonomous …
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives
Recent advancements in Vision-Language Models (VLMs) have sparked interest in their use
for autonomous driving, particularly in generating interpretable driving decisions through …
for autonomous driving, particularly in generating interpretable driving decisions through …
KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing
Deep hashing has been widely used for large-scale approximate nearest neighbor search
due to its storage and search efficiency. However, existing deep hashing methods …
due to its storage and search efficiency. However, existing deep hashing methods …