Understanding World or Predicting Future? A Comprehensive Survey of World Models

J Ding, Y Zhang, Y Shang, Y Zhang, Z Zong… - arxiv preprint arxiv …, 2024 - arxiv.org
The concept of world models has garnered significant attention due to advancements in
multimodal large language models such as GPT-4 and video generation models such as …

Covla: Comprehensive vision-language-action dataset for autonomous driving

H Arai, K Miwa, K Sasaki, Y Yamaguchi… - arxiv preprint arxiv …, 2024 - arxiv.org
Autonomous driving, particularly navigating complex and unanticipated scenarios, demands
sophisticated reasoning and planning capabilities. While Multi-modal Large Language …

OmniHD-Scenes: A next-generation multimodal dataset for autonomous driving

L Zheng, L Yang, Q Lin, W Ai, M Liu, S Lu, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid advancement of deep learning has intensified the need for comprehensive data
for use by autonomous driving algorithms. High-quality datasets are crucial for the …

[PDF][PDF] Occfiner: Offboard occupancy refinement with hybrid propagation

H Shi, S Wang, J Zhang, X Yin, Z Wang… - arxiv preprint arxiv …, 2024 - researchgate.net
Vision-based occupancy prediction, also known as 3D Semantic Scene Completion (SSC),
presents a significant challenge in computer vision. Previous methods, confined to onboard …

Offboard Occupancy Refinement with Hybrid Propagation for Autonomous Driving

H Shi, S Wang, J Zhang, X Yin, Z Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Vision-based occupancy prediction, also known as 3D Semantic Scene Completion (SSC),
presents a significant challenge in computer vision. Previous methods, confined to onboard …

LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement

D Fu, N Zhong, X Han, P Cai, L Wen, S Mao… - arxiv preprint arxiv …, 2025 - arxiv.org
Closed-loop simulation environments play a crucial role in the validation and enhancement
of autonomous driving systems (ADS). However, certain challenges warrant significant …

BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving

T Tang, D Wei, Z Jia, T Gao, C Cai, C Hou, P Jia… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid development of the autonomous driving industry has led to a significant
accumulation of autonomous driving data. Consequently, there comes a growing demand …

[KNIHA][B] Knowledge-centric Machine Learning on Graphs

Y Tian - 2024 - search.proquest.com
Abstract Graph Machine Learning (GML) has gained considerable attention in modeling
complex graph-structured data, but many of them focus on collecting high-quality data (ie …

Application of foundation models for autonomous driving: a survey of data synthesis

S Gao, B Gao, P Wei, J Guo, M Yuan… - … Conference on Traffic …, 2024 - spiedigitallibrary.org
With the evolution of data-driven autonomous driving technology, transferring driving
responsibility from humans to machines is now feasible. Addressing the long-tail distribution …

[PDF][PDF] Optimizing Task Planning Efficiency in LLMs: Beyond Closed-Loop Systems

L Liu, A Nair, T Peng, S Desai, M Gupta… - Authorea …, 2024 - researchgate.net
Large language models (LLMs) have shown great promise in task execution, but traditional
closed-loop systems limit their planning efficiency. Addressing this challenge, we introduce …