Xizhou Zhu

Sitert av

	Alle	Siden 2020
Sitater	18908	18399
h-indeks	38	38
i10-indeks	53	53

8000

4000

2000

6000

20182019202020212022202320242025147 303 705 1606 2844 4347 7565 1319

Offentlig tilgang

Vis alle

15 artikler

0 artikler

tilgjengelige

ikke tilgjengelige

Basert på finansieringsmandater

Følg

Xizhou Zhu

Tsinghua University

Verifisert e-postadresse på tsinghua.edu.cn


Tittel Sorter etter sitater Sorter etter år Sorter etter tittel	Sitert av Sitert av	År
Deformable detr: Deformable transformers for end-to-end object detection X Zhu, W Su, L Lu, B Li, X Wang, J Dai arXiv preprint arXiv:2010.04159, 2020	6315	2020
Deformable convnets v2: More deformable, better results X Zhu, H Hu, S Lin, J Dai Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019	2606	2019
Vl-bert: Pre-training of generic visual-linguistic representations W Su, X Zhu, Y Cao, B Li, L Lu, F Wei, J Dai arXiv preprint arXiv:1908.08530, 2019	1947	2019
Deep feature flow for video recognition X Zhu, Y Xiong, J Dai, L Yuan, Y Wei Proceedings of the IEEE conference on computer vision and pattern …, 2017	868	2017
Internimage: Exploring large-scale vision foundation models with deformable convolutions W Wang, J Dai, Z Chen, Z Huang, Z Li, X Zhu, X Hu, T Lu, L Lu, H Li, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	848	2023
Flow-guided feature aggregation for video object detection X Zhu, Y Wang, J Dai, L Yuan, Y Wei Proceedings of the IEEE international conference on computer vision, 408-417, 2017	842	2017
An empirical study of spatial attention mechanisms in deep networks X Zhu, D Cheng, Z Zhang, S Lin, J Dai Proceedings of the IEEE/CVF international conference on computer vision …, 2019	623	2019
Planning-oriented autonomous driving Y Hu, J Yang, L Chen, K Li, C Sima, X Zhu, S Chai, S Du, T Lin, W Wang, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023	621	2023
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ... Advances in Neural Information Processing Systems 36, 61501-61513, 2023	446	2023
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... Science China Information Sciences 67 (12), 220101, 2024	389	2024
Deformable detr: Deformable transformers for end-to-end object detection. arXiv 2020 X Zhu, W Su, L Lu, B Li, X Wang, J Dai arXiv preprint arXiv:2010.04159 3, 2010	365	2010
Towards high performance video object detection X Zhu, J Dai, L Yuan, Y Wei Proceedings of the IEEE conference on computer vision and pattern …, 2018	327	2018
Bevformer v2: Adapting modern image backbones to bird's-eye-view recognition via perspective supervision C Yang, Y Chen, H Tian, C Tao, X Zhu, Z Zhang, G Huang, H Li, Y Qiao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	271	2023
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, M Zhong, Q Zhang, X Zhu, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2024	185	2024
Delving into the devils of bird’s-eye-view perception: A review, evaluation and recipe H Li, C Sima, J Dai, W Wang, L Lu, H Wang, J Zeng, Z Li, J Yang, H Deng, ... IEEE Transactions on Pattern Analysis and Machine Intelligence 46 (4), 2151-2170, 2023	144	2023
Uni-perceiver: Pre-training unified architecture for generic perception for zero-shot and few-shot tasks X Zhu, J Zhu, H Li, X Wu, H Li, X Wang, J Dai Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	139	2022
Spatially adaptive inference with stochastic feature sampling and interpolation Z Xie, Z Zhang, X Zhu, G Huang, S Lin Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020	117	2020
Siamese image modeling for self-supervised vision representation learning C Tao, X Zhu, W Su, G Huang, B Li, J Zhou, Y Qiao, X Wang, J Dai Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	111	2023
Ghost in the minecraft: Generally capable agents for open-world environments via large language models with text-based knowledge and memory X Zhu, Y Chen, H Tian, C Tao, W Su, C Yang, G Huang, B Li, L Lu, ... arXiv preprint arXiv:2305.17144, 2023	100	2023
Drivemlm: Aligning multi-modal large language models with behavioral planning states for autonomous driving W Wang, J Xie, CY Hu, H Zou, J Fan, W Tong, Y Wen, S Wu, H Deng, Z Li, ... arXiv preprint arXiv:2312.09245, 2023	98	2023

Systemet kan ikke utføre handlingen. Prøv på nytt senere.

Artikler 1–20

Sitater per år

Duplikatsitater

Sammenslåtte sitater

Legg til medforfattereMedforfattere

Følg

Sitert av