팔로우
Shaoxiang Chen
Shaoxiang Chen
Meituan
fudan.edu.cn의 이메일 확인됨
제목
인용
인용
연도
Semantic proposal for activity localization in videos via sentence query
S Chen, YG Jiang
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 8199-8206, 2019
2102019
Black-box adversarial attacks on video recognition models
L Jiang, X Ma, S Chen, J Bailey, YG Jiang
Proceedings of the 27th ACM International Conference on Multimedia, 864-872, 2019
1672019
Motion guided spatial attention for video captioning
S Chen, YG Jiang
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 8191-8198, 2019
1542019
Learning modality interaction for temporal sentence localization and event captioning in videos
S Chen, W Jiang, W Liu, YG Jiang
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
1112020
MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection
Y Jiao, Z Jie, S Chen, J Chen, X Wei, L Ma, YG Jiang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1002023
Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning
S Chen, YG Jiang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
822021
Motion Guided Region Message Passing for Video Captioning
S Chen, YG Jiang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
742021
Deep Learning for Video Captioning: A Review
S Chen, T Yao, YG Jiang
Proceedings of the 28th International Joint Conference on Artificial …, 2019
622019
Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language
S Chen, YG Jiang
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
592020
More: Multi-order relation mining for dense captioning in 3d scenes
Y Jiao, S Chen, Z Jie, J Chen, L Ma, YG Jiang
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
452022
Non-local netvlad encoding for video classification
Y Tang, X Zhang, L Ma, J Wang, S Chen, YG Jiang
The 2nd Workshop on YouTube-8M Large-Scale Video Understanding (ECCV'18), 2018
452018
Llava-mole: Sparse mixture of lora experts for mitigating data conflicts in instruction finetuning mllms
S Chen, Z Jie, L Ma
arXiv preprint arXiv:2401.16160, 2024
432024
Scene graph refinement network for visual question answering
T Qian, J Chen, S Chen, B Wu, YG Jiang
IEEE Transactions on Multimedia 25, 3950-3961, 2022
432022
FT-TDR: Frequency-guided Transformer and Top-Down Refinement Network for Blind Face Inpainting
J Wang, S Chen, Z Wu, YG Jiang
IEEE Transactions on Multimedia, 2022
312022
Aggregating frame-level features for large-scale video classification
S Chen, X Wang, Y Tang, X Chen, Z Wu, YG Jiang
CVPR'17 Workshop on YouTube-8M Large-Scale Video Understanding, 2017
292017
Self-supervised learning for semi-supervised temporal language grounding
F Luo, S Chen, J Chen, Z Wu, YG Jiang
IEEE Transactions on Multimedia, 2022
162022
Instance-aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning
Y Jiao, Z Jie, S Chen, L Cheng, J Chen, L Ma, YG Jiang
AAAI 2024, 2023
72023
Towards Bridging Video and Language by Caption Generation and Sentence Localization
S Chen
Proceedings of the 29th ACM International Conference on Multimedia, 2964-2968, 2021
72021
System and method for video captioning
Y Jiang, S Chen
US Patent 10,699,129, 2020
72020
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Y Jiao, S Chen, Z Jie, J Chen, L Ma, YG Jiang
Advances in Neural Information Processing Systems 37 (NeurIPS 2024), 2024
62024
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20