SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation Z Luo*, Y Xiao*, Y Liu*, S Li, Y Wang, Y Tang, X Li, Y Yang NeurIPS 2023, 2023 | 41 | 2023 |
Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection Y Xiao, Z Luo, Y Liu, Y Ma, H Bian, Y Ji, Y Yang, X Li CVPR 2024, 2023 | 32 | 2023 |
COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing J Wang, Y Ma, J Guo, Y Xiao, G Huang, X Li NeurIPS 2024, 2024 | 15 | 2024 |
SemanticAC: Semantics-Assisted Framework for Audio Classification Y Xiao, Y Ma, S Li, H Zhou, R Liao, X Li ICASSP 2023, 2023 | 8 | 2023 |
MambaTree: Tree Topology is All You Need in State Space Model Y Xiao, L Song, S Huang, J Wang, S Song, Y Ge, X Li, Y Shan NeurIPS 2024 Spotlight, 0 | 7* | |
1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation Z Luo*, Y Xiao*, Y Liu*, Y Wang, Y Tang, X Li, Y Yang ICCV Workshop 2023, 0 | 3* | |
HDC: Hierarchical Semantic Decoding with Counting Assistance for Generalized Referring Expression Segmentation Z Luo, Y Wu, Y Liu, Y Xiao, XP Zhang, Y Yang arXiv preprint arXiv:2405.15658, 2024 | 2 | 2024 |
Rethinking Video-Text Pretraining Via Masked Autoencoder S Chen, Y Ma, B Jia, X Li, Y Xiao, T Yang Available at SSRN 5071942, 0 | | |