Botian Shi

Viittaukset

	Kaikki	2020 lähtien
Sitaatit	2576	2567
h-indeksi	25	25
i10-indeksi	32	32

1500

750

375

1125

20202021202220232024202538 88 196 444 1490 309

Yleisessä käytössä

Näytä kaikki

12 artikkelia

4 artikkelia

käytettävissä

ei käytettävissä

Perustuu rahoitusehtoihin

Muut kirjoittajat

Nan DuanTech Fellow, StepFun | Senior Principal Researcher, Microsoft Research (2012-2024)Vahvistettu sähköpostiosoite verkkotunnuksessa microsoft.com
Huaishao LuoJD AI ResearchVahvistettu sähköpostiosoite verkkotunnuksessa jd.com
Ming Zhou (周明)Chief Scientist at Sinovation, ACL president (2019), VP of CCF(2020-2024)Vahvistettu sähköpostiosoite verkkotunnuksessa chuangxin.com
Pan LuStanford UniversityVahvistettu sähköpostiosoite verkkotunnuksessa stanford.edu
Yaobo Liangmicrosoft.comVahvistettu sähköpostiosoite verkkotunnuksessa microsoft.com
Zhongyuan WangBAAIVahvistettu sähköpostiosoite verkkotunnuksessa baai.ac.cn
Yujing WangPeking University, Microsoft ResearchVahvistettu sähköpostiosoite verkkotunnuksessa microsoft.com
Graham NeubigCarnegie Mellon University, All Hands AIVahvistettu sähköpostiosoite verkkotunnuksessa cs.cmu.edu
Junyi DuUniversity of Southern CaliforniaVahvistettu sähköpostiosoite verkkotunnuksessa usc.edu
Fangzheng (Frank) XuMicrosoft AIVahvistettu sähköpostiosoite verkkotunnuksessa microsoft.com
Rong-Cheng TuNanyang Technological UniversityVahvistettu sähköpostiosoite verkkotunnuksessa ntu.edu.sg

Seuraa

Botian Shi

Shanghai Artificial Intelligence Laboratory

Vahvistettu sähköpostiosoite verkkotunnuksessa pjlab.org.cn

VLMs Document Understanding Autonomous Driving


Nimike Lajittele sitaattien mukaan Lajittele vuoden mukaan Lajittele otsikon mukaan	Viittaukset Viittaukset	Vuosi
Univl: A unified video and language pre-training model for multimodal understanding and generation H Luo, L Ji, B Shi, H Huang, N Duan, T Li, J Li, T Bharti, M Zhou arXiv preprint arXiv:2002.06353, 2020	502	2020
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... arXiv preprint arXiv:2404.16821, 2024	386	2024
Drive like a human: Rethinking autonomous driving with large language models D Fu, X Li, L Wen, M Dou, P Cai, B Shi, Y Qiao 2024 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops …, 2024	164	2024
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models L Wen, D Fu, X Li, X Cai, T Ma, P Cai, M Dou, B Shi, L He, Y Qiao The Twelfth International Conference on Learning Representations (ICLR), 2024	155	2024
Multi-modal sensor fusion for auto driving perception: A survey K Huang, B Shi, X Li, X Li, S Huang, Y Li arXiv preprint arXiv:2202.02703, 2022	148	2022
Logonet: Towards accurate 3d object detection with local-to-global cross-modal fusion X Li, T Ma, Y Hou, B Shi, Y Yang, Y Liu, X Wu, Q Chen, Y Li, Y Qiao, L He Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	120	2023
Knowledge Aware Semantic Concept Expansion for Image-Text Matching. B Shi, L Ji, P Lu, Z Niu, N Duan Proceedings of the Twenty-Eighth International Joint Conference on …, 2019	85	2019
Dense procedure captioning in narrated instructional videos B Shi, L Ji, Y Liang, N Duan, P Chen, Z Niu, M Zhou Proceedings of the 57th annual meeting of the association for computational …, 2019	82	2019
On the road with gpt-4v (ision): Early explorations of visual-language model on autonomous driving L Wen, X Yang, D Fu, X Wang, P Cai, X Li, T Ma, Y Li, L Xu, D Shang, ... arXiv preprint arXiv:2311.05332, 2023	70	2023
Streetsurf: Extending multi-view implicit surface reconstruction to street views J Guo, N Deng, X Li, Y Bai, B Shi, C Wang, C Ding, D Wang, Y Li arXiv preprint arXiv:2306.04988, 2023	70	2023
Microsoft concept graph: Mining semantic concepts for short text understanding L Ji, Y Wang, B Shi, D Zhang, Z Wang, J Yan Data Intelligence 1 (3), 238-270, 2019	67	2019
Multi-sensor fusion and cooperative perception for autonomous driving: A review C Xiang, C Feng, X Xie, B Shi, H Lu, Y Lv, M Yang, Z Niu IEEE Intelligent Transportation Systems Magazine 15 (5), 36-58, 2023	60	2023
Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection X Li, B Shi, Y Hou, X Wu, T Ma, Y Li, L He Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022	57	2022
Uni3d: A unified baseline for multi-dataset 3d object detection B Zhang, J Yuan, B Shi, T Chen, Y Li, Y Qiao Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	49	2023
Chartx & chartvlm: A versatile benchmark and foundation model for complicated chart reasoning R Xia, B Zhang, H Ye, X Yan, Q Liu, H Zhou, Z Chen, M Dou, B Shi, J Yan, ... arXiv preprint arXiv:2402.12185, 2024	43	2024
Expanding performance boundaries of open-source multimodal models with model, data, and test-time scaling Z Chen, W Wang, Y Cao, Y Liu, Z Gao, E Cui, J Zhu, S Ye, H Tian, Z Liu, ... arXiv preprint arXiv:2412.05271, 2024	38	2024
Is sora a world simulator? a comprehensive survey on general world models and beyond Z Zhu, X Wang, W Zhao, C Min, N Deng, M Dou, Y Wang, B Shi, K Wang, ... arXiv preprint arXiv:2405.03520, 2024	36	2024
Bi3d: Bi-domain active learning for cross-domain 3d object detection J Yuan, B Zhang, X Yan, T Chen, B Shi, Y Li, Y Qiao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	34	2023
Ad-pt: Autonomous driving pre-training with large-scale point cloud dataset J Yuan, B Zhang, X Yan, B Shi, T Chen, Y Li, Y Qiao Advances in Neural Information Processing Systems 36, 47914-47933, 2023	32	2023
Towards knowledge-driven autonomous driving X Li, Y Bai, P Cai, L Wen, D Fu, B Zhang, X Yang, X Cai, T Ma, J Guo, ... arXiv preprint arXiv:2312.04316, 2023	32	2023

Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.

Artikkelit 1–20

Sitaatteja vuodessa

Päällekkäiset lähteet

Yhdistetyt sitaatit

Lisää muut kirjoittajatMuut kirjoittajat

Seuraa

Viittaukset

Muut kirjoittajat