Radar perception in autonomous driving: Exploring different data representations
With the rapid advancements of sensor technology and deep learning, autonomous driving
systems are providing safe and efficient access to intelligent vehicles as well as intelligent …
systems are providing safe and efficient access to intelligent vehicles as well as intelligent …
Learning modality-agnostic representation for semantic segmentation from any modalities
Image modality is not perfect as it often fails in certain conditions, eg, night and fast motion.
This significantly limits the robustness and versatility of existing multi-modal (ie, Image+ X) …
This significantly limits the robustness and versatility of existing multi-modal (ie, Image+ X) …
Empowering autonomous driving with large language models: A safety perspective
Autonomous Driving (AD) encounters significant safety hurdles in long-tail unforeseen
driving scenarios, largely stemming from the non-interpretability and poor generalization of …
driving scenarios, largely stemming from the non-interpretability and poor generalization of …
Centering the value of every modality: Towards efficient and resilient modality-agnostic semantic segmentation
Fusing an arbitrary number of modalities is vital for achieving robust multi-modal fusion of
semantic segmentation yet remains less explored to date. Recent endeavors regard RGB …
semantic segmentation yet remains less explored to date. Recent endeavors regard RGB …
Muvo: A multimodal generative world model for autonomous driving with geometric representations
Learning unsupervised world models for autonomous driving has the potential to improve
the reasoning capabilities of today's systems dramatically. However, most work neglects the …
the reasoning capabilities of today's systems dramatically. However, most work neglects the …
[PDF][PDF] Exploring backdoor attacks against large language model-based decision making
Abstract Large Language Models (LLMs) have shown significant promise in decisionmaking
tasks when fine-tuned on specific applications, leveraging their inherent common sense and …
tasks when fine-tuned on specific applications, leveraging their inherent common sense and …
Lexicon3d: Probing visual foundation models for complex 3d scene understanding
Complex 3D scene understanding has gained increasing attention, with scene encoding
strategies playing a crucial role in this success. However, the optimal scene encoding …
strategies playing a crucial role in this success. However, the optimal scene encoding …
Unleashing hydra: Hybrid fusion, depth consistency and radar for unified 3d perception
Low-cost, vision-centric 3D perception systems for autonomous driving have made
significant progress in recent years, narrowing the gap to expensive LiDAR-based methods …
significant progress in recent years, narrowing the gap to expensive LiDAR-based methods …
Blos-bev: Navigation map enhanced lane segmentation network, beyond line of sight
Bird's-eye-view (BEV) representation is crucial for the perception function in autonomous
driving tasks. It is difficult to balance the accuracy, efficiency and range of BEV …
driving tasks. It is difficult to balance the accuracy, efficiency and range of BEV …
CountFormer: Multi-view Crowd Counting Transformer
Multi-view counting (MVC) methods have shown their superiority over single-view
counterparts, particularly in situations characterized by heavy occlusion and severe …
counterparts, particularly in situations characterized by heavy occlusion and severe …