Flowformer: A transformer architecture for optical flow Z Huang*, X Shi*, C Zhang, Q Wang, KC Cheung, H Qin, J Dai, H Li Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022 | 326 | 2022 |
Fuseformer: Fusing fine-grained information in transformers for video inpainting R Liu, H Deng, Y Huang, X Shi, L Lu, W Sun, X Wang, J Dai, H Li Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 166 | 2021 |
Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation X Shi, Z Huang, D Li, M Zhang, KC Cheung, S See, H Qin, J Dai, H Li Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 100 | 2023 |
Videoflow: Exploiting temporal cues for multi-frame optical flow estimation X Shi, Z Huang, W Bian, D Li, M Zhang, KC Cheung, S See, H Qin, J Dai, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 77 | 2023 |
Decoupled spatial-temporal transformer for video inpainting R Liu, H Deng, Y Huang, X Shi, L Lu, W Sun, X Wang, J Dai, H Li arXiv preprint arXiv:2104.06637, 2021 | 70 | 2021 |
Motion-i2v: Consistent and controllable image-to-video generation with explicit motion modeling X Shi, Z Huang, FY Wang, W Bian, D Li, Y Zhang, M Zhang, KC Cheung, ... ACM SIGGRAPH 2024 Conference Papers, 1-11, 2024 | 58 | 2024 |
Kbnet: Kernel basis network for image restoration Y Zhang, D Li, X Shi, D He, K Song, X Wang, H Qin, H Li arXiv preprint arXiv:2303.02881, 2023 | 58 | 2023 |
A simple baseline for video restoration with grouped spatial-temporal shift D Li, X Shi, Y Zhang, KC Cheung, S See, X Wang, H Qin, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 55 | 2023 |
Blinkflow: A dataset to push the limits of event-based optical flow estimation Y Li, Z Huang, S Chen, X Shi, H Li, H Bao, Z Cui, G Zhang 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023 | 32 | 2023 |
Animatelcm: Accelerating the animation of personalized diffusion models and adapters with decoupled consistency learning FY Wang, Z Huang, X Shi, W Bian, G Song, Y Liu, H Li arXiv preprint arXiv:2402.00769, 2024 | 26 | 2024 |
A unified conditional framework for diffusion-based image restoration Y Zhang, X Shi, D Li, X Wang, J Wang, H Li Advances in Neural Information Processing Systems 36, 2024 | 15 | 2024 |
Context-PIPs: Persistent Independent Particles Demands Spatial Context Features W Bian, Z Huang, X Shi, Y Dong, Y Li, H Li Advances in Neural Information Processing Systems 36, 55285-55298, 2023 | 10 | 2023 |
Context-tap: Tracking any point demands spatial context features W Bian, Z Huang, X Shi, Y Dong, Y Li, H Li arXiv preprint arXiv:2306.02000 3, 2023 | 10 | 2023 |
No attention is needed: Grouped spatial-temporal shift for simple and efficient video restorers D Li, X Shi, Y Zhang, X Wang, H Qin, H Li arXiv preprint arXiv:2206.10810, 2022 | 6 | 2022 |
Be-your-outpainter: Mastering video outpainting through input-specific adaptation FY Wang, X Wu, Z Huang, X Shi, D Shen, G Song, Y Liu, H Li European Conference on Computer Vision, 153-168, 2024 | 5 | 2024 |
Flowformer: A transformer architecture and its masked cost volume autoencoding for optical flow Z Huang, X Shi, C Zhang, Q Wang, Y Li, H Qin, J Dai, X Wang, H Li arXiv preprint arXiv:2306.05442, 2023 | 5 | 2023 |
Blinkvision: A benchmark for optical flow, scene flow and point tracking estimation using rgb frames and events Y Li, Y Shen, Z Huang, S Chen, W Bian, X Shi, FY Wang, K Sun, H Bao, ... European Conference on Computer Vision, 19-36, 2024 | 2 | 2024 |
Animatelcm: Computation-efficient personalized style video generation without personalized video data FY Wang, Z Huang, W Bian, X Shi, K Sun, G Song, Y Liu, H Li SIGGRAPH Asia 2024 Technical Communications, 1-5, 2024 | 1 | 2024 |
GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking W Bian, Z Huang, X Shi, Y Li, FY Wang, H Li arXiv preprint arXiv:2501.02690, 2025 | | 2025 |
Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediction Tasks M Zhang, G Song, X Shi, Y Liu, H Li European Conference on Computer Vision, 128-145, 2024 | | 2024 |