Towards open vocabulary learning: A survey J Wu, X Li, S Xu, H Yuan, H Ding, Y Yang, X Li, J Zhang, Y Tong, X Jiang, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | 126 | 2024 |
Towards robust referring image segmentation J Wu, X Li, X Li, H Ding, Y Tong, D Tao IEEE Transactions on Image Processing, 2024 | 49 | 2024 |
Betrayed by captions: Joint caption grounding and generation for open vocabulary instance segmentation J Wu, X Li, H Ding, X Li, G Cheng, Y Tong, CC Loy Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 33 | 2023 |
Towards language-driven video inpainting via multimodal large language models J Wu, X Li, C Si, S Zhou, J Yang, J Zhang, Y Li, K Chen, Y Tong, Z Liu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 21 | 2024 |
Motionbooth: Motion-aware customized text-to-video generation J Wu, X Li, Y Zeng, J Zhang, Q Zhou, Y Li, Y Tong, K Chen arXiv preprint arXiv:2406.17758, 2024 | 16 | 2024 |
Auto cherry-picker: Learning from high-quality generative data driven by language Y Chen, X Li, Y Li, Y Zeng, J Wu, X Zhao, K Chen arXiv preprint arXiv:2406.20085, 2024 | 2 | 2024 |
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation J Wu, C Tang, J Wang, Y Zeng, X Li, Y Tong arXiv preprint arXiv:2412.07589, 2024 | 1 | 2024 |
RelationBooth: Towards Relation-Aware Customized Object Generation Q Shi, L Qi, J Wu, J Bai, J Wang, Y Tong, X Li, MH Yang arXiv preprint arXiv:2410.23280, 2024 | | 2024 |