Adapt: Action-aware driving caption transformer B Jin, X Liu, Y Zheng, P Li, H Zhao, T Zhang, Y Zheng, G Zhou, J Liu 2023 IEEE International Conference on Robotics and Automation (ICRA), 7554-7561, 2023 | 68 | 2023 |
Steps: Joint self-supervised nighttime image enhancement and depth estimation Y Zheng, C Zhong, P Li, H Gao, Y Zheng, B Jin, L Wang, H Zhao, G Zhou, ... 2023 IEEE International Conference on Robotics and Automation (ICRA), 4916-4923, 2023 | 36 | 2023 |
GaussianGrasper: 3D Language Gaussian Splatting for Open-vocabulary Robotic Grasping Y Zheng, X Chen, Y Zheng, S Gu, R Yang, B Jin, P Li, C Zhong, Z Wang, ... arXiv preprint arXiv:2403.09637, 2024 | 20 | 2024 |
Monoocc: Digging into monocular semantic occupancy prediction Y Zheng, X Li, P Li, Y Zheng, B Jin, C Zhong, X Long, H Zhao, Q Zhang arXiv preprint arXiv:2403.08766, 2024 | 18 | 2024 |
Language-guided semantic style transfer of 3d indoor scenes B Jin, B Tian, H Zhao, G Zhou Proceedings of the 1st Workshop on Photorealistic Image and Environment …, 2022 | 12 | 2022 |
PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning Y Zheng, Z Xing, Q Zhang, B Jin, P Li, Y Zheng, Z Xia, K Zhan, X Lang, ... arXiv preprint arXiv:2406.01587, 2024 | 9 | 2024 |
Tod3cap: Towards 3d dense captioning in outdoor scenes B Jin, Y Zheng, P Li, W Li, Y Zheng, S Hu, X Liu, J Zhu, Z Yan, H Sun, ... European Conference on Computer Vision, 367-384, 2024 | 8 | 2024 |
Dome: Taming diffusion model into high-fidelity controllable occupancy world model S Gu, W Yin, B Jin, X Guo, J Wang, H Li, Q Zhang, X Long arXiv preprint arXiv:2410.10429, 2024 | 4 | 2024 |
Hiprompt: Tuning-free higher-resolution generation with hierarchical mllm prompts X Liu, Y He, L Guo, X Li, B Jin, P Li, Y Li, CM Chan, Q Chen, W Xue, ... arXiv preprint arXiv:2409.02919, 2024 | 3 | 2024 |
Hint-ad: Holistically aligned interpretability in end-to-end autonomous driving K Ding, B Chen, Y Su, H Gao, B Jin, C Sima, W Zhang, X Li, P Barsch, ... arXiv preprint arXiv:2409.06702, 2024 | 2 | 2024 |
Preliminary Investigation into Data Scaling Laws for Imitation Learning-Based End-to-End Autonomous Driving Y Zheng, Z Xia, Q Zhang, T Zhang, B Lu, X Huo, C Han, Y Li, M Yu, B Jin, ... arXiv preprint arXiv:2412.02689, 2024 | 1 | 2024 |
OccVAR: Scalable 4D Occupancy Prediction via Next-Scale Prediction B Jin, X Hu, Y Zheng, X Guo, Q Zhang, Y Yao, D Zhang, X Long, W Yin | | |
3D Dense Captioning beyond Nouns: A Middleware for Autonomous Driving B Jin, Y Zheng, P Li, S Hu, Z Yan, X Liu, Y Zheng, J Huang, J Zhu, G Zhou, ... | | |