Cascade cost volume for high-resolution multi-view stereo and stereo matching X Gu, Z Fan, S Zhu, Z Dai, F Tan, P Tan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 843 | 2020 |
Neural window fully-connected crfs for monocular depth estimation W Yuan, X Gu, Z Dai, S Zhu, P Tan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 365 | 2022 |
Batch dropblock network for person re-identification and beyond Z Dai, M Chen, X Gu, S Zhu, P Tan Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 308 | 2019 |
Cluster contrast for unsupervised person re-identification Z Dai, G Wang, W Yuan, S Zhu, P Tan Proceedings of the Asian Conference on Computer Vision, 1142-1160, 2022 | 278 | 2022 |
Champ: Controllable and consistent human image animation with 3d parametric guidance S Zhu, JL Chen, Z Dai, Z Dong, Y Xu, X Cao, Y Yao, H Zhu, S Zhu European Conference on Computer Vision, 145-162, 2024 | 67 | 2024 |
Batch feature erasing for person re-identification and beyond Z Dai, M Chen, S Zhu, P Tan arXiv preprint arXiv:1811.07130 1 (2), 3, 2018 | 62 | 2018 |
Gaussian-flow: 4d reconstruction with dynamic 3d gaussian particle Y Lin, Z Dai, S Zhu, Y Yao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 60 | 2024 |
Dro: Deep recurrent optimizer for video to depth X Gu, W Yuan, Z Dai, S Zhu, C Tang, Z Dong, P Tan IEEE Robotics and Automation Letters 8 (5), 2844-2851, 2023 | 46* | 2023 |
Rcp: Recurrent closest point for point cloud X Gu, C Tang, W Yuan, Z Dai, S Zhu, P Tan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 32 | 2022 |
Tora: Trajectory-oriented diffusion transformer for video generation Z Zhang, J Liao, M Li, Z Dai, B Qiu, S Zhu, L Qin, W Wang arXiv preprint arXiv:2407.21705, 2024 | 22 | 2024 |
AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance Z Dai, Z Zhang, Y Yao, B Qiu, S Zhu, L Qin, W Wang arXiv e-prints, arXiv: 2311.12886, 2023 | 18* | 2023 |
Meshmvs: multi-view stereo guided mesh reconstruction R Shrestha, Z Fan, Q Su, Z Dai, S Zhu, P Tan 2021 International Conference on 3D Vision (3DV), 1290-1300, 2021 | 17 | 2021 |
Uvosam: A mask-free paradigm for unsupervised video object segmentation via segment anything model Z Zhang, S Zhang, Z Wei, Z Dai, S Zhu arXiv preprint arXiv:2305.12659, 2023 | 13 | 2023 |
Towards Robust Video Instance Segmentation with Temporal-Aware Transformer Z Zhang, F Shao, Z Dai, S Zhu arXiv preprint arXiv:2301.09416, 2023 | 2 | 2023 |
MWVOS: Mask-Free Weakly Supervised Video Object Segmentation via promptable foundation model Z Zhang, S Zhang, Z Dai, Z Dong, S Zhu Pattern Recognition 159, 111100, 2025 | 1 | 2025 |
Hunyuanvideo: A systematic framework for large video generative models W Kong, Q Tian, Z Zhang, R Min, Z Dai, J Zhou, J Xiong, X Li, B Wu, ... arXiv preprint arXiv:2412.03603, 2024 | 1 | 2024 |
EffiVED: Efficient Video Editing via Text-instruction Diffusion Models Z Zhang, Z Dai, L Qin, W Wang arXiv preprint arXiv:2403.11568, 2024 | 1 | 2024 |
Fine-grained Text-Video Retrieval with Frozen Image Encoders Z Dai, F Shao, Q Su, Z Dong, S Zhu arXiv preprint arXiv:2307.09972, 2023 | 1 | 2023 |
Text–video retrieval re-ranking via multi-grained cross attention and frozen image encoders Z Dai, K Cheng, F Shao, Z Dong, S Zhu Pattern Recognition 159, 111099, 2025 | | 2025 |