Cris: Clip-driven referring image segmentation Z Wang, Y Lu, Q Li, X Tao, Y Guo, M Gong, T Liu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 397 | 2022 |
GINet: Graph interaction network for scene parsing T Wu, Y Lu, Y Zhu, C Zhang, M Wu, Z Ma, G Guo Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 44 | 2020 |
Freelong: Training-free long video generation with spectralblend temporal attention Y Lu, Y Liang, L Zhu, Y Yang NeurIPS 2024, 2024 | 13 | 2024 |
Show me a video: A large-scale narrated video dataset for coherent story illustration Y Lu, F Ni, H Wang, X Guo, L Zhu, Z Yang, R Song, L Cheng, Y Yang IEEE Transactions on Multimedia, 2023 | 13 | 2023 |
Zero-shot video grounding with pseudo query lookup and verification Y Lu, R Quan, L Zhu, Y Yang IEEE Transactions on Image Processing 33, 1643-1654, 2024 | 12 | 2024 |
Flowzero: Zero-shot text-to-video synthesis with llm-driven dynamic scene syntax Y Lu, L Zhu, H Fan, Y Yang arXiv preprint arXiv:2311.15813, 2023 | 11 | 2023 |
Automated multi-level preference for mllms M Zhang, W Wu, Y Lu, Y Song, K Rong, H Yao, J Zhao, F Liu, Y Sun, ... arXiv preprint arXiv:2405.11165, 2024 | 5 | 2024 |
C-DLinkNet: considering multi-level semantic features for human parsing Y Lu, M Feng, M Wu, C Zhang arXiv preprint arXiv:2001.11690, 2020 | 3 | 2020 |
ECLIP: Efficient Contrastive Language-Image Pretraining via Ensemble Confidence Learning and Masked Language Modeling J Wang, H Wang, W Wu, J Deng, Y Lu, X Guo, D Zhang First Workshop on Pre-training: Perspectives, Pitfalls, and Paths Forward at …, 2022 | 1 | 2022 |
Exploiting Unlabeled Videos for Video-Text Retrieval via Pseudo-Supervised Learning Y Lu, R Quan, L Zhu, Y Yang IEEE Transactions on Image Processing, 2024 | | 2024 |