Revisiting temporal modeling for clip-based image-to-video knowledge transferring R Liu, J Huang, G Li, J Feng, X Wu, TH Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 58 | 2023 |
St-llm: Large language models are effective temporal learners R Liu, C Li, H Tang, Y Ge, Y Shan, G Li European Conference on Computer Vision, 1-18, 2024 | 55 | 2024 |
BT-Adapter: Video Conversation is Feasible Without Video Instruction Tuning R Liu, C Li, Y Ge, TH Li, Y Shan, G Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 40 | 2024 |
Contextual debiasing for visual recognition with causal mechanisms R Liu, H Liu, G Li, H Hou, TH Yu, T Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 38 | 2022 |
Memory-based network for scene graph with unbalanced relations W Wang, R Liu, M Wang, S Wang, X Chang, Y Chen Proceedings of the 28th ACM International Conference on Multimedia, 2400-2408, 2020 | 20 | 2020 |
Causality compensated attention for contextual biased visual recognition R Liu, J Huang, TH Li, G Li The eleventh international conference on learning representations, 2022 | 19 | 2022 |
Rap: Efficient text-video retrieval with sparse-and-correlated adapter M Cao, H Tang, J Huang, P Jin, C Zhang, R Liu, L Chen, X Liang, L Yuan, ... arXiv preprint arXiv:2405.19465, 2024 | 13 | 2024 |
Mug-STAN: adapting image-language pretrained models for general video understanding R Liu, J Huang, W Gao, TH Li, G Li arXiv preprint arXiv:2311.15075, 2023 | 12 | 2023 |
An opencv-based framework for table information extraction J Yuan, H Li, M Wang, R Liu, C Li, B Wang 2020 IEEE International Conference on Knowledge Graph (ICKG), 621-628, 2020 | 12 | 2020 |
Muse: Mamba is efficient multi-scale learner for text-video retrieval H Tang, M Cao, J Huang, R Liu, P Jin, G Li, X Liang arXiv preprint arXiv:2408.10575, 2024 | 8 | 2024 |
Physgame: Uncovering physical commonsense violations in gameplay videos M Cao, H Tang, H Zhao, H Guo, J Liu, G Zhang, R Liu, Q Sun, I Reid, ... arXiv preprint arXiv:2412.01800, 2024 | 4 | 2024 |
Ppllava: Varied video sequence understanding with prompt guidance R Liu, H Tang, H Liu, Y Ge, Y Shan, C Li, J Yang arXiv preprint arXiv:2411.02327, 2024 | 4 | 2024 |
Characterizing robotic and organic query in sparql search sessions X Zhang, M Wang, B Zhao, R Liu, J Zhang, H Yang Web and Big Data: 4th International Joint Conference, APWeb-WAIM 2020 …, 2020 | 2 | 2020 |