Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection S Wang, Y Liu, T Wang, Y Li, X Zhang | 195 | 2023 |
Far3d: Expanding the horizon for surround-view 3d object detection X Jiang, S Li, Y Liu, S Wang, F Jia, T Wang, L Han, X Zhang Proceedings of the AAAI Conference on Artificial Intelligence 38 (3), 2561-2569, 2024 | 48 | 2024 |
Eagle: Exploring the design space for multimodal llms with mixture of encoders M Shi, F Liu, S Wang, S Liao, S Radhakrishnan, DA Huang, H Yin, ... arXiv preprint arXiv:2408.15998, 2024 | 43 | 2024 |
OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning S Wang, Z Yu, X Jiang, S Lan, M Shi, N Chang, J Kautz, Y Li, JM Alvarez arXiv preprint arXiv:2405.01533, 2024 | 32 | 2024 |
Focal-petr: Embracing foreground for efficient multi-camera 3d object detection S Wang, X Jiang, Y Li IEEE Transactions on Intelligent Vehicles, 2023 | 29 | 2023 |
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation Z Li, K Li, S Wang, S Lan, Z Yu, Y Ji, Z Li, Z Zhu, J Kautz, Z Wu, YG Jiang, ... arXiv preprint arXiv:2406.06978, 2024 | 13 | 2024 |
StreamChat: Chatting with Streaming Video J Liu, Z Yu, S Lan, S Wang, R Fang, J Kautz, H Li, JM Alvare arXiv preprint arXiv:2412.08646, 2024 | | 2024 |