Radar perception in autonomous driving: Exploring different data representations

S Yao, R Guan, Z Peng, C Xu, Y Shi, Y Yue… - arxiv preprint arxiv …, 2023 - arxiv.org
With the rapid advancements of sensor technology and deep learning, autonomous driving
systems are providing safe and efficient access to intelligent vehicles as well as intelligent …

Learning modality-agnostic representation for semantic segmentation from any modalities

X Zheng, Y Lyu, L Wang - European Conference on Computer Vision, 2024 - Springer
Image modality is not perfect as it often fails in certain conditions, eg, night and fast motion.
This significantly limits the robustness and versatility of existing multi-modal (ie, Image+ X) …

Empowering autonomous driving with large language models: A safety perspective

Y Wang, R Jiao, SS Zhan, C Lang, C Huang… - arxiv preprint arxiv …, 2023 - arxiv.org
Autonomous Driving (AD) encounters significant safety hurdles in long-tail unforeseen
driving scenarios, largely stemming from the non-interpretability and poor generalization of …

Centering the value of every modality: Towards efficient and resilient modality-agnostic semantic segmentation

X Zheng, Y Lyu, J Zhou, L Wang - European Conference on Computer …, 2024 - Springer
Fusing an arbitrary number of modalities is vital for achieving robust multi-modal fusion of
semantic segmentation yet remains less explored to date. Recent endeavors regard RGB …

Muvo: A multimodal generative world model for autonomous driving with geometric representations

D Bogdoll, Y Yang, JM Zöllner - arxiv preprint arxiv:2311.11762, 2023 - arxiv.org
Learning unsupervised world models for autonomous driving has the potential to improve
the reasoning capabilities of today's systems dramatically. However, most work neglects the …

[PDF][PDF] Exploring backdoor attacks against large language model-based decision making

R Jiao, S **e, J Yue, T Sato, L Wang… - arxiv preprint arxiv …, 2024 - researchgate.net
Abstract Large Language Models (LLMs) have shown significant promise in decisionmaking
tasks when fine-tuned on specific applications, leveraging their inherent common sense and …

Lexicon3d: Probing visual foundation models for complex 3d scene understanding

Y Man, S Zheng, Z Bao, M Hebert, LY Gui… - arxiv preprint arxiv …, 2024 - arxiv.org
Complex 3D scene understanding has gained increasing attention, with scene encoding
strategies playing a crucial role in this success. However, the optimal scene encoding …

Unleashing hydra: Hybrid fusion, depth consistency and radar for unified 3d perception

P Wolters, J Gilg, T Teepe, F Herzog, A Laouichi… - arxiv preprint arxiv …, 2024 - arxiv.org
Low-cost, vision-centric 3D perception systems for autonomous driving have made
significant progress in recent years, narrowing the gap to expensive LiDAR-based methods …

Blos-bev: Navigation map enhanced lane segmentation network, beyond line of sight

H Wu, Z Zhang, S Lin, T Qin, J Pan… - 2024 IEEE Intelligent …, 2024 - ieeexplore.ieee.org
Bird's-eye-view (BEV) representation is crucial for the perception function in autonomous
driving tasks. It is difficult to balance the accuracy, efficiency and range of BEV …

CountFormer: Multi-view Crowd Counting Transformer

H Mo, X Zhang, J Tan, C Yang, Q Gu, B Hang… - … on Computer Vision, 2024 - Springer
Multi-view counting (MVC) methods have shown their superiority over single-view
counterparts, particularly in situations characterized by heavy occlusion and severe …