ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation Y Xu, J Zhang, Q Zhang, D Tao Neurips 2022, 2022 | 661 | 2022 |
The seventh visual object tracking VOT2019 challenge results M Kristan, J Matas, A Leonardis, M Felsberg, R Pflugfelder, ... Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 591 | 2019 |
Vitae: Vision transformer advanced by exploring intrinsic inductive bias Y Xu, Q Zhang, J Zhang, D Tao Advances in neural information processing systems 34, 28522-28535, 2021 | 380 | 2021 |
Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model D Wang, Q Zhang, Y Xu, J Zhang, B Du, D Tao, L Zhang TGRS 2022, 2022 | 259 | 2022 |
Vitaev2: Vision transformer advanced by exploring inductive bias for image recognition and beyond Q Zhang*, Y Xu*, J Zhang, D Tao IJCV2022, 2022 | 252 | 2022 |
Ap-10k: A benchmark for animal pose estimation in the wild H Yu*, Y Xu*, J Zhang, W Zhao, Z Guan, D Tao Neurips 2021 Dataset Track, 2021 | 123 | 2021 |
Vitpose++: Vision transformer for generic body pose estimation Y Xu, J Zhang, Q Zhang, D Tao IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 79* | 2023 |
VSA: Learning Varied-Size Window Attention in Vision Transformers Q Zhang*, Y Xu*, J Zhang, D Tao ECCV 2022, 2022 | 71 | 2022 |
Dut: Learning video stabilization by simply watching unstable videos Y Xu, J Zhang, SJ Maybank, D Tao IEEE Transactions on Image Processing, 2022 | 53 | 2022 |
Vision transformer with quadrangle attention Q Zhang, J Zhang, Y Xu, D Tao IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | 49 | 2024 |
APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking Y Yang, J Yang, Y Xu, J Zhang, L Lan, D Tao Neurips 2022 Dataset Track, 2022 | 41 | 2022 |
Regioncl: Exploring contrastive region pairs for self-supervised representation learning Y Xu, Q Zhang, J Zhang, D Tao European conference on computer vision, 477-494, 2022 | 37* | 2022 |
Clamp: Prompt-based contrastive learning for connecting language and animal pose X Zhang, W Wang, Z Chen, Y Xu, J Zhang, D Tao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 26 | 2023 |
1st workshop on maritime computer vision (macvi) 2023: Challenge results B Kiefer, M Kristan, J Perš, L Žust, F Poiesi, F Andrade, A Bernardino, ... Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023 | 25 | 2023 |
Out-of-boundary view synthesis towards full-frame video stabilization Y Xu, J Zhang, D Tao Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 17 | 2021 |
Handrefiner: Refining malformed hands in generated images by diffusion-based conditional inpainting W Lu, Y Xu, J Zhang, C Wang, D Tao Proceedings of the 32nd ACM International Conference on Multimedia, 7085-7093, 2024 | 15 | 2024 |
Transformer-based context condensation for boosting feature pyramids in object detection Z Chen, J Zhang, Y Xu, D Tao International Journal of Computer Vision 131 (10), 2738-2756, 2023 | 11 | 2023 |
Revolutionizing agrifood systems with artificial intelligence: a survey T Chen, L Lv, D Wang, J Zhang, Y Yang, Z Zhao, C Wang, X Guo, H Chen, ... ACM Computing Surveys, 2023 | 5 | 2023 |
When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability W Xuan, Y Xu, S Zhao, C Wang, J Liu, B Du, D Tao Proceedings of the 32nd ACM International Conference on Multimedia, 6979-6988, 2024 | 2 | 2024 |
APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and Beyond Y Yang, Y Deng, Y Xu, J Zhang arXiv preprint arXiv:2312.15612, 2023 | 2 | 2023 |