GroundingGPT:Language Enhanced Multi-modal Grounding Model Z Li, Q Xu, D Zhang, H Song, Y Cai, Q Qi, R Zhou, J Pan, Z Li, VT Vu, ... ACL 2024, 2024 | 40* | 2024 |
Leveraging intra-domain knowledge to strengthen cross-domain crowd counting Y Cai, L Chen, Z Ma, C Lu, C Wang, G He ICME 2021, Oral, 2021 | 8 | 2021 |
Unifiedmllm: Enabling unified representation for multi-modal multi-tasks with large language model Z Li, W Wang, Y Cai, X Qi, P Wang, D Zhang, H Song, B Jiang, Z Huang, ... NAACL 2025, 2024 | 6 | 2024 |
Multi-Prototype Space Learning for Commonsense-Based Scene Graph Generation L Chen, Y Song, Y Cai, J Lu, Y Li, Y Xie, C Wang, G He AAAI 2024, 2024 | 3 | 2024 |
Explicit invariant feature induced cross-domain crowd counting Y Cai, L Chen, H Guan, S Lin, C Lu, C Wang, G He AAAI 2023, 2023 | 3 | 2023 |
Exploring contextual relationships in 3d cloud points by semantic knowledge mining L Chen, J Lu, Y Cai, C Wang, G He Computer Graphics Forum 41 (7), 75-86, 2022 | 3 | 2022 |
Dh-gcn: Saliency-aware complex scene graph generation using dual-hierarchy graph convolutional network J Lu, L Chen, Y Cai, H Guan, C Lu, C Wang, G He ICME 2022, Oral, 2022 | 2 | 2022 |
Global Representation Guided Adaptive Fusion Network for Stable Video Crowd Counting Y Cai, Z Ma, C Lu, C Wang, G He TMM 2022, 2022 | 2 | 2022 |
Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models W Wang, Z Li, Q Xu, L Li, YQ Cai, B Jiang, H Song, X Hu, P Wang, L Xiao arXiv preprint arXiv:2411.09691, 2024 | 1 | 2024 |
QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models W Wang, Z Li, Q Xu, Y Cai, H Song, Q Qi, R Zhou, Z Huang, T Wang, ... arXiv preprint arXiv:2405.13014, 2024 | 1 | 2024 |
Video-based spatio-temporal scene graph generation with efficient self-supervision tasks L Chen, Y Cai, C Lu, C Wang, G He Multimedia Tools and Applications 82 (25), 38947-38966, 2023 | 1 | 2023 |
Exploring Contextual Relationships in 3D Cloud Points by Semantic Knowledge Mining: Supplementary Material L Chen, J Lu, Y Cai, C Wang, G He | | 2022 |