Grid-centric traffic scenario perception for autonomous driving: A comprehensive review

Y Shi, K Jiang, J Li, Z Qian, J Wen… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
The grid-centric perception is a crucial field for mobile robot perception and navigation.
Nonetheless, the grid-centric perception is less prevalent than object-centric perception as …

Genad: Generative end-to-end autonomous driving

W Zheng, R Song, X Guo, C Zhang, L Chen - European Conference on …, 2024 - Springer
Directly producing planning results from raw sensors has been a long-desired solution for
autonomous driving and has attracted increasing attention recently. Most existing end-to …

Gaussianformer: Scene as gaussians for vision-based 3d semantic occupancy prediction

Y Huang, W Zheng, Y Zhang, J Zhou, J Lu - European Conference on …, 2024 - Springer
Abstract 3D semantic occupancy prediction aims to obtain 3D fine-grained geometry and
semantics of the surrounding scene and is an important task for the robustness of vision …

Is sora a world simulator? a comprehensive survey on general world models and beyond

Z Zhu, X Wang, W Zhao, C Min, N Deng, M Dou… - arxiv preprint arxiv …, 2024 - arxiv.org
General world models represent a crucial pathway toward achieving Artificial General
Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual …

Towards knowledge-driven autonomous driving

X Li, Y Bai, P Cai, L Wen, D Fu, B Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper explores the emerging knowledge-driven autonomous driving technologies. Our
investigation highlights the limitations of current autonomous driving systems, in particular …

A survey on occupancy perception for autonomous driving: The information fusion perspective

H Xu, J Chen, S Meng, Y Wang, LP Chau - Information Fusion, 2025 - Elsevier
Abstract 3D occupancy perception technology aims to observe and understand dense 3D
environments for autonomous vehicles. Owing to its comprehensive perception capability …

Bevworld: A multimodal world model for autonomous driving via unified bev latent space

Y Zhang, S Gong, K **ong, X Ye, X Tan, F Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
World models are receiving increasing attention in autonomous driving for their ability to
predict potential future scenarios. In this paper, we present BEVWorld, a novel approach that …

Occllama: An occupancy-language-action generative world model for autonomous driving

J Wei, S Yuan, P Li, Q Hu, Z Gan, W Ding - arxiv preprint arxiv …, 2024 - arxiv.org
The rise of multi-modal large language models (MLLMs) has spurred their applications in
autonomous driving. Recent MLLM-based methods perform action by learning a direct …

Forging vision foundation models for autonomous driving: Challenges, methodologies, and opportunities

X Yan, H Zhang, Y Cai, J Guo, W Qiu, B Gao… - arxiv preprint arxiv …, 2024 - arxiv.org
The rise of large foundation models, trained on extensive datasets, is revolutionizing the
field of AI. Models such as SAM, DALL-E2, and GPT-4 showcase their adaptability by …

World models for autonomous driving: An initial survey

Y Guan, H Liao, Z Li, J Hu, R Yuan, Y Li… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
In the rapidly evolving landscape of autonomous driving, the capability to accurately predict
future events and assess their implications is paramount for both safety and efficiency …