Follow
Zongyang Ma
Zongyang Ma
Verified email at ia.ac.cn
Title
Cited by
Cited by
Year
Open-vocabulary one-stage detection with hierarchical visual-language knowledge distillation
Z Ma, G Luo, J Gao, L Li, Y Chen, S Wang, C Zhang, W Hu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
942022
Large Language Models Meet Text-Centric Multimodal Sentiment Analysis: A Survey
H Yang, Y Zhao, Y Wu, S Wang, T Zheng, H Zhang, Z Ma, W Che, B Qin
arXiv preprint arXiv:2406.08068, 2024
132024
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval
Y Chen*, Z Ma*, Z Zhang*, Z Qi, C Yuan, Y Shan, B Li, W Hu, X Qie, J Wu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
112023
Chinese Title Generation for Short Videos: Dataset, Metric and Algorithm
Z Zhang*, Z Ma*, C Yuan, Y Chen, P Wang, Z Qi, C Hao, B Li, Y Shan, ...
IEEE Transactions on Pattern Analysis & Machine Intelligence, 1-16, 2024
10*2024
Et bench: Towards open-ended event-level video-language understanding
Y Liu, Z Ma, Z Qi, Y Wu, Y Shan, CW Chen
arXiv preprint arXiv:2409.18111, 2024
52024
Learning semantics-grounded vocabulary representation for video-text retrieval
Y Shi, H Liu, H Xu, Z Ma, Q Ye, A Hu, M Yan, J Zhang, F Huang, C Yuan, ...
Proceedings of the 31st ACM International Conference on Multimedia, 4460-4470, 2023
52023
Order-Prompted Tag Sequence Generation for Video Tagging
Z Ma, Z Zhang, Y Chen, Z Qi, Y Luo, Z Li, C Yuan, B Li, X Qie, Y Shan, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
32023
mRAG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
T Zhang, Z Zhang, Z Ma, Y Chen, Z Qi, C Yuan, B Li, J Pu, Y Zhao, Z Xie, ...
arXiv preprint arXiv:2411.15041, 2024
12024
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Y Chen*, Z Ma*, Z Zhang*, Z Qi, C Yuan, B Li, J Pu, Y Shan, X Qi, W Hu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
12024
EA-VTR: Event-Aware Video-Text Retrieval
Z Ma, Z Zhang, Y Chen, Z Qi, C Yuan, B Li, Y Luo, X Li, X Qi, Y Shan, ...
European Conference on Computer Vision, 76-94, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–10