Drivedreamer-2: Llm-enhanced world models for diverse driving video generation G Zhao, X Wang, Z Zhu, X Chen, G Huang, X Bao, X Wang arXiv preprint arXiv:2403.06845, 2024 | 41 | 2024 |
Drivedreamer4d: World models are effective data machines for 4d driving scene representation G Zhao, C Ni, X Wang, Z Zhu, X Zhang, Y Wang, G Huang, X Chen, ... arXiv preprint arXiv:2410.13571, 2024 | 8 | 2024 |
Cores: Orchestrating the dance of reasoning and segmentation X Bao, S Sun, S Ma, K Zheng, Y Guo, G Zhao, Y Zheng, X Wang European Conference on Computer Vision, 187-204, 2024 | 3 | 2024 |
ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration C Ni, G Zhao, X Wang, Z Zhu, W Qin, G Huang, C Liu, Y Chen, Y Wang, ... arXiv preprint arXiv:2411.19548, 2024 | 2 | 2024 |
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation X Wang, K Zhao, F Liu, J Wang, G Zhao, X Bao, Z Zhu, Y Zhang, X Wang arXiv preprint arXiv:2411.08380, 2024 | 1 | 2024 |