Zongyang Ma

Cited by

	All	Since 2020
Citations	143	143
h-index	5	5
i10-index	4	4

100

20222023202420252 37 91 13

Public access

View all

5 articles

1 article

available

not available

Based on funding mandates

Co-authors

Weiming HuNLPRVerified email at nlpr.ia.ac.cn
Yuxin ChenNational Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences.Verified email at ia.ac.cn
Zhongang Qi (祁仲昂)Principal Researcher, ARC Lab, Tencent PCGVerified email at tencent.com
Ying ShanDistinguished Scientist at Tencent, Director of ARC Lab & AI Lab CVCVerified email at tencent.com
Bing LiProfessor of National Laboratory of Pattern Recognition, Institute of Automation, ChineseVerified email at nlpr.ia.ac.cn
Chunfeng YuanNational Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of SciencesVerified email at nlpr.ia.ac.cn
Ziqi ZhangPh.D. of Institute of automation, Chinese Academy of SciencesVerified email at ia.ac.cn

Zongyang Ma

MAIS & NLPR, Institute of Automation, Chinese Academy of Sciences.

Verified email at ia.ac.cn

MLLM Vision and Language


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Open-vocabulary one-stage detection with hierarchical visual-language knowledge distillation Z Ma, G Luo, J Gao, L Li, Y Chen, S Wang, C Zhang, W Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	94	2022
Large Language Models Meet Text-Centric Multimodal Sentiment Analysis: A Survey H Yang, Y Zhao, Y Wu, S Wang, T Zheng, H Zhang, Z Ma, W Che, B Qin arXiv preprint arXiv:2406.08068, 2024	13	2024
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval Y Chen, Z Ma, Z Zhang*, Z Qi, C Yuan, Y Shan, B Li, W Hu, X Qie, J Wu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	11	2023
Chinese Title Generation for Short Videos: Dataset, Metric and Algorithm Z Zhang, Z Ma, C Yuan, Y Chen, P Wang, Z Qi, C Hao, B Li, Y Shan, ... IEEE Transactions on Pattern Analysis & Machine Intelligence, 1-16, 2024	10*	2024
Et bench: Towards open-ended event-level video-language understanding Y Liu, Z Ma, Z Qi, Y Wu, Y Shan, CW Chen arXiv preprint arXiv:2409.18111, 2024	5	2024
Learning semantics-grounded vocabulary representation for video-text retrieval Y Shi, H Liu, H Xu, Z Ma, Q Ye, A Hu, M Yan, J Zhang, F Huang, C Yuan, ... Proceedings of the 31st ACM International Conference on Multimedia, 4460-4470, 2023	5	2023
Order-Prompted Tag Sequence Generation for Video Tagging Z Ma, Z Zhang, Y Chen, Z Qi, Y Luo, Z Li, C Yuan, B Li, X Qie, Y Shan, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	3	2023
mRAG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA T Zhang, Z Zhang, Z Ma, Y Chen, Z Qi, C Yuan, B Li, J Pu, Y Zhao, Z Xie, ... arXiv preprint arXiv:2411.15041, 2024	1	2024
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval? Y Chen, Z Ma, Z Zhang*, Z Qi, C Yuan, B Li, J Pu, Y Shan, X Qi, W Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	1	2024
EA-VTR: Event-Aware Video-Text Retrieval Z Ma, Z Zhang, Y Chen, Z Qi, C Yuan, B Li, Y Luo, X Li, X Qi, Y Shan, ... European Conference on Computer Vision, 76-94, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–10

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors