Multimodal analysis for deep video understanding with video language transformer B Zhang, Y Fang, T Ren, G Wu Proceedings of the 30th ACM International Conference on Multimedia, 7165-7169, 2022 | 5 | 2022 |
Substation meter detection and recognition method based on lightweight deep learning model W Yang, W Luo, J Mao, Y Fang, J Bei International Symposium on Artificial Intelligence and Robotics 2022 12508 …, 2022 | 4 | 2022 |
Hybrid improvements in multimodal analysis for deep video understanding B Zhang, F Yu, Y Fang, T Ren, G Wu Proceedings of the 3rd ACM International Conference on Multimedia in Asia, 1-5, 2021 | 3 | 2021 |
Deep Video Understanding with Video-Language Model R Liu, Y Fang, F Yu, R Tian, T Ren, G Wu Proceedings of the 31st ACM International Conference on Multimedia, 9551-9555, 2023 | 2 | 2023 |
Semantic-guided RGB-Thermal Crowd Counting with Segment Anything Model Y Fang, Y Shi, J Bei, T Ren Proceedings of the 2024 International Conference on Multimedia Retrieval …, 2024 | 1 | 2024 |
MMSF: A multimodal sentiment-fused method to recognize video speaking style B Zhang, Y Fang, F Yu, J Bei, T Ren Proceedings of the 2023 ACM International Conference on Multimedia Retrieval …, 2023 | 1 | 2023 |
CAGNet: a context-aware graph neural network for detecting social relationships in videos F Yu, Y Fang, Z Zhao, J Bei, T Ren, G Wu Visual Intelligence 2 (1), 22, 2024 | | 2024 |
Reproducibility Companion Paper of" MMSF: A Multimodal Sentiment-Fused Method to Recognize Video Speaking Style" F Yu, B Zhang, Y Fang, J Bei, T Ren, J Li, L Rossetto Proceedings of the 2024 International Conference on Multimedia Retrieval …, 2024 | | 2024 |
ADNet: An Asymmetric Dual-Stream Network for RGB-T Salient Object Detection Y Fang, R Hou, J Bei, T Ren, G Wu Proceedings of the 5th ACM International Conference on Multimedia in Asia, 1-7, 2023 | | 2023 |