Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection

E Zhou, Q Su, C Chi, Z Zhang, Z Wang, T Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
Automatic detection and prevention of open-set failures are crucial in closed-loop robotic
systems. Recent studies often struggle to simultaneously identify unexpected failures …

Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation

Y Wang, X He, K Wang, L Ma, J Yang, S Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
The current state-of-the-art video generative models can produce commercial-grade videos
with highly realistic details. However, they still struggle to coherently present multiple …

Owl-1: Omni World Model for Consistent Long Video Generation

Y Huang, W Zheng, Y Gao, X Tao, P Wan… - arxiv preprint arxiv …, 2024 - arxiv.org
Video generation models (VGMs) have received extensive attention recently and serve as
promising candidates for general-purpose large vision models. While they can only …

GameFactory: Creating New Games with Generative Interactive Videos

J Yu, Y Qin, X Wang, P Wan, D Zhang, X Liu - arxiv preprint arxiv …, 2025 - arxiv.org
Generative game engines have the potential to revolutionize game development by
autonomously creating new content and reducing manual workload. However, existing …

A Survey of World Models for Autonomous Driving

T Feng, W Wang, Y Yang - arxiv preprint arxiv:2501.11260, 2025 - arxiv.org
Recent breakthroughs in autonomous driving have revolutionized the way vehicles perceive
and interact with their surroundings. In particular, world models have emerged as a linchpin …