Shiwei Wu

Citeret af

	Alle	Siden 2020
Henvisninger	217	217
h-index	10	10
i10-indeks	11	11

140

105

2020202120222023202420255 4 13 35 126 34

Offentlig adgang

Se alle

5 artikler

tilgængelige

ikke tilgængelige

Baseret på krav i forbindelse med finansiering

Medforfattere

Tong XuProfessor, University of Science and Technology of ChinaVerificeret mail på ustc.edu.cn
Enhong ChenUniversity of Science and Technology of ChinaVerificeret mail på ustc.edu.cn
Joya ChenNational University of SingaporeVerificeret mail på u.nus.edu
Mike Z. SHOUNational U. of Singapore; Facebook AI; Columbia UniversityVerificeret mail på columbia.edu
Kevin Qinghong LinNational University of SingaporeVerificeret mail på u.nus.edu
Liyi ChenUniversity of Science and Technology of ChinaVerificeret mail på mail.ustc.edu.cn

Følg

Shiwei Wu

University of Science and Technology of China

Verificeret mail på mail.ustc.edu.cn

VideoLLM Movie understanding Computer Vision in Social Science


Titel Sortér efter henvisninger Sortér efter årstal Sortér efter titel	Citeret af Citeret af	År
Relation-enhanced negative sampling for multimodal knowledge graph completion D Xu, T Xu, S Wu, J Zhou, E Chen Proceedings of the 30th ACM international conference on multimedia, 3857-3866, 2022	38	2022
Videollm-online: Online video large language model for streaming video J Chen, Z Lv, S Wu, KQ Lin, C Song, D Gao, JW Liu, Z Gao, D Mao, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	26	2024
Linking the characters: Video-oriented social graph generation via hierarchical-cumulative GCN S Wu, J Chen, T Xu, L Chen, L Wu, Y Hu, E Chen Proceedings of the 29th ACM International Conference on Multimedia, 4716-4724, 2021	26	2021
Notellm: A retrievable large language model for note recommendation C Zhang, S Wu, H Zhang, T Xu, Y Gao, Y Hu, E Chen Companion Proceedings of the ACM Web Conference 2024, 170-179, 2024	23	2024
Multi-grained multimodal interaction network for entity linking P Luo, T Xu, S Wu, C Zhu, L Xu, E Chen Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023	15	2023
Is heuristic sampling necessary in training deep object detectors? J Chen, D Liu, T Xu, S Wu, Y Cheng, E Chen IEEE Transactions on Image Processing 30, 8454-8467, 2021	14	2021
TOMGPT: reliable text-only training approach for cost-effective multi-modal large language model Y Chen, Q Wang, S Wu, Y Gao, T Xu, Y Hu ACM Transactions on Knowledge Discovery from Data 18 (7), 1-19, 2024	13	2024
NoteLLM-2: multimodal large representation models for recommendation C Zhang, H Zhang, S Wu, D Wu, T Xu, Y Gao, Y Hu, E Chen arXiv preprint arXiv:2405.16789, 2024	10	2024
AU-aware graph convolutional network for Macroand Micro-expression spotting S Yin, S Wu, T Xu, S Liu, S Zhao, E Chen 2023 IEEE International Conference on Multimedia and Expo (ICME), 228-233, 2023	10	2023
Unified QA-aware knowledge graph generation based on multi-modal modeling P Qin, J Yu, Y Gao, D Xu, Y Chen, S Wu, T Xu, E Chen, Y Hao Proceedings of the 30th ACM International Conference on Multimedia, 7185-7189, 2022	10	2022
Is sampling heuristics necessary in training deep object detectors? J Chen, D Liu, T Xu, S Zhang, S Wu, B Luo arXiv preprint arXiv:1909.04868, 2019	10	2019
When I fall in love: Capturing video-oriented social relationship evolution via attentive GNN P Qin, S Wu, T Xu, Y Hao, F Feng, C Zhu, E Chen IEEE Transactions on Circuits and Systems for Video Technology 34 (6), 5160-5175, 2023	4	2023
VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation S Wu, J Chen, KQ Lin, Q Wang, Y Gao, Q Xu, T Xu, Y Hu, E Chen, ... Advances in Neural Information Processing Systems, 2024	3	2024
Comprehending the gossips: Meme explanation in time-sync video comment via multimodal cues Z Xie, W He, T Xu, S Wu, C Zhu, P Yang, E Chen ACM Transactions on Asian and Low-Resource Language Information Processing …, 2023	3	2023
Winning the CVPR'2022 AQTC Challenge: A Two-stage Function-centric Approach S Wu, W He, T Xu, H Wang, E Chen CVPR Workshop, 2022	3	2022
Showui: One vision-language-action model for gui visual agent KQ Lin, L Li, D Gao, Z Yang, S Wu, Z Bai, W Lei, L Wang, MZ Shou arXiv preprint arXiv:2411.17465, 2024	2	2024
AU-aware graph convolutional network for Macro-and Micro-expression spotting S Yin, S Wu, T Xu, S Liu, S Zhao, E Chen arXiv preprint arXiv:2303.09114, 2023	2	2023
From a social cognitive perspective: Context-aware visual social relationship recognition S Wu, C Zhang, J Chen, T Xu, L Wu, Y Hu, E Chen arXiv preprint arXiv:2406.08358, 2024	1	2024
Communication-Efficient Distributed Learning with Local Immediate Error Compensation Y Cheng, L Shen, L Xu, X Qian, S Wu, Y Zhou, T Zhang, D Tao, E Chen arXiv preprint arXiv:2402.11857, 2024	1	2024
SGAT: scene graph attention network for video recommendation X Wang, T Xu, S Wu Proceedings of the 2023 5th International Conference on Image, Video and …, 2023	1	2023

Systemet kan ikke foretage handlingen nu. Prøv igen senere.

Artikler 1–20

Henvisninger pr. år

Dublerede henvisninger

Flettede henvisninger

Tilføj medforfattereMedforfattere

Følg

Citeret af

Medforfattere