Relation-enhanced negative sampling for multimodal knowledge graph completion D Xu, T Xu, S Wu, J Zhou, E Chen Proceedings of the 30th ACM international conference on multimedia, 3857-3866, 2022 | 38 | 2022 |
Videollm-online: Online video large language model for streaming video J Chen, Z Lv, S Wu, KQ Lin, C Song, D Gao, JW Liu, Z Gao, D Mao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 26 | 2024 |
Linking the characters: Video-oriented social graph generation via hierarchical-cumulative GCN S Wu, J Chen, T Xu, L Chen, L Wu, Y Hu, E Chen Proceedings of the 29th ACM International Conference on Multimedia, 4716-4724, 2021 | 26 | 2021 |
Notellm: A retrievable large language model for note recommendation C Zhang, S Wu, H Zhang, T Xu, Y Gao, Y Hu, E Chen Companion Proceedings of the ACM Web Conference 2024, 170-179, 2024 | 23 | 2024 |
Multi-grained multimodal interaction network for entity linking P Luo, T Xu, S Wu, C Zhu, L Xu, E Chen Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 15 | 2023 |
Is heuristic sampling necessary in training deep object detectors? J Chen, D Liu, T Xu, S Wu, Y Cheng, E Chen IEEE Transactions on Image Processing 30, 8454-8467, 2021 | 14 | 2021 |
TOMGPT: reliable text-only training approach for cost-effective multi-modal large language model Y Chen, Q Wang, S Wu, Y Gao, T Xu, Y Hu ACM Transactions on Knowledge Discovery from Data 18 (7), 1-19, 2024 | 13 | 2024 |
NoteLLM-2: multimodal large representation models for recommendation C Zhang, H Zhang, S Wu, D Wu, T Xu, Y Gao, Y Hu, E Chen arXiv preprint arXiv:2405.16789, 2024 | 10 | 2024 |
AU-aware graph convolutional network for Macroand Micro-expression spotting S Yin, S Wu, T Xu, S Liu, S Zhao, E Chen 2023 IEEE International Conference on Multimedia and Expo (ICME), 228-233, 2023 | 10 | 2023 |
Unified QA-aware knowledge graph generation based on multi-modal modeling P Qin, J Yu, Y Gao, D Xu, Y Chen, S Wu, T Xu, E Chen, Y Hao Proceedings of the 30th ACM International Conference on Multimedia, 7185-7189, 2022 | 10 | 2022 |
Is sampling heuristics necessary in training deep object detectors? J Chen, D Liu, T Xu, S Zhang, S Wu, B Luo arXiv preprint arXiv:1909.04868, 2019 | 10 | 2019 |
When I fall in love: Capturing video-oriented social relationship evolution via attentive GNN P Qin, S Wu, T Xu, Y Hao, F Feng, C Zhu, E Chen IEEE Transactions on Circuits and Systems for Video Technology 34 (6), 5160-5175, 2023 | 4 | 2023 |
VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation S Wu, J Chen, KQ Lin, Q Wang, Y Gao, Q Xu, T Xu, Y Hu, E Chen, ... Advances in Neural Information Processing Systems, 2024 | 3 | 2024 |
Comprehending the gossips: Meme explanation in time-sync video comment via multimodal cues Z Xie, W He, T Xu, S Wu, C Zhu, P Yang, E Chen ACM Transactions on Asian and Low-Resource Language Information Processing …, 2023 | 3 | 2023 |
Winning the CVPR'2022 AQTC Challenge: A Two-stage Function-centric Approach S Wu, W He, T Xu, H Wang, E Chen CVPR Workshop, 2022 | 3 | 2022 |
Showui: One vision-language-action model for gui visual agent KQ Lin, L Li, D Gao, Z Yang, S Wu, Z Bai, W Lei, L Wang, MZ Shou arXiv preprint arXiv:2411.17465, 2024 | 2 | 2024 |
AU-aware graph convolutional network for Macro-and Micro-expression spotting S Yin, S Wu, T Xu, S Liu, S Zhao, E Chen arXiv preprint arXiv:2303.09114, 2023 | 2 | 2023 |
From a social cognitive perspective: Context-aware visual social relationship recognition S Wu, C Zhang, J Chen, T Xu, L Wu, Y Hu, E Chen arXiv preprint arXiv:2406.08358, 2024 | 1 | 2024 |
Communication-Efficient Distributed Learning with Local Immediate Error Compensation Y Cheng, L Shen, L Xu, X Qian, S Wu, Y Zhou, T Zhang, D Tao, E Chen arXiv preprint arXiv:2402.11857, 2024 | 1 | 2024 |
SGAT: scene graph attention network for video recommendation X Wang, T Xu, S Wu Proceedings of the 2023 5th International Conference on Image, Video and …, 2023 | 1 | 2023 |