Zijia Zhao

Citado por

	Total	Desde 2020
Citas	329	329
Índice h	8	8
Índice i10	6	6

240

120

180

202120222023202420251 9 53 226 40

Acceso público

Ver todo

3 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Jing Liu 刘静Professor in Institute of Automation of the Chinese Academy Sciences (CASIA)Dirección de correo verificada de nlpr.ia.ac.cn
Longteng GuoAssociate Professor, Institute of Automation of the Chinese Academy Sciences (CASIA)Dirección de correo verificada de nlpr.ia.ac.cn
Tongtian YueInstitute of Automation, Chinese Academy of SciencesDirección de correo verificada de ia.ac.cn
Shuai ShaoTencentDirección de correo verificada de tencent.com
Sihan ChenInstitute of Automation, Chinese Academy of SciencesDirección de correo verificada de nlpr.ia.ac.cn
Haoyu LuRenmin University of ChinaDirección de correo verificada de ruc.edu.cn
Yuqi HuoBaichuan Inc.Dirección de correo verificada de baichuan-inc.com

Seguir

Zijia Zhao

Institute of Automation, Chinese Academy Sciences (CASIA)

Dirección de correo verificada de ia.ac.cn

Multimodal learning


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Vast: A vision-audio-subtitle-text omni-modality foundation model and dataset S Chen, H Li, Q Wang, Z Zhao, M Sun, X Zhu, J Liu Advances in Neural Information Processing Systems 36, 72842-72866, 2023	110	2023
Vl-mamba: Exploring state space models for multimodal learning Y Qiao, Z Yu, L Guo, S Chen, Z Zhao, M Sun, Q Wu, J Liu arXiv preprint arXiv:2403.13600, 2024	63	2024
Chatbridge: Bridging modalities with large language model as a language catalyst Z Zhao, L Guo, T Yue, S Chen, S Shao, X Zhu, Z Yuan, J Liu arXiv preprint arXiv:2305.16103, 2023	54	2023
Opt: Omni-perception pre-trainer for cross-modal understanding and generation J Liu, X Zhu, F Liu, L Guo, Z Zhao, M Sun, W Wang, H Lu, S Zhou, J Zhang, ... arXiv preprint arXiv:2107.00249, 2021	47	2021
Mamo: Fine-grained vision-language representations learning with masked multimodal modeling Z Zhao, L Guo, X He, S Shao, Z Yuan, J Liu Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023	16*	2023
Sc-tune: Unleashing self-consistent referential comprehension in large vision language models T Yue, J Cheng, L Guo, X Dai, Z Zhao, X He, G Xiong, Y Lv, J Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	10	2024
Needle in a video haystack: A scalable synthetic framework for benchmarking video mllms Z Zhao, H Lu, Y Huo, Y Du, T Yue, L Guo, B Wang, W Chen, J Liu arXiv e-prints, arXiv: 2406.09367, 2024	9	2024
Mm21 pre-training for video understanding challenge: Video captioning with pretraining techniques S Chen, X Zhu, D Hao, W Liu, J Liu, Z Zhao, L Guo, J Liu Proceedings of the 29th ACM International Conference on Multimedia, 4853-4857, 2021	8	2021
Towards event-oriented long video understanding Y Du, K Zhou, Y Huo, Y Li, WX Zhao, H Lu, Z Zhao, B Wang, W Chen, ... arXiv preprint arXiv:2406.14129, 2024	7	2024
Beyond literal descriptions: understanding and locating open-world objects aligned with human intentions W Wang, Y Zhang, X He, Y Yan, Z Zhao, X Wang, J Liu arXiv preprint arXiv:2402.11265, 2024	2	2024
ChatSearch: a Dataset and a Generative Retrieval Model for General Conversational Image Retrieval Z Zhao, L Guo, T Yue, E Hu, S Shao, Z Yuan, H Huang, J Liu arXiv preprint arXiv:2410.18715, 2024	1	2024
Exploring the design space of visual context representation in video mllms Y Du, Y Huo, K Zhou, Z Zhao, H Lu, H Huang, WX Zhao, B Wang, W Chen, ... arXiv preprint arXiv:2410.13694, 2024	1	2024
OneDiff: A Generalist Model for Image Difference Captioning E Hu, L Guo, T Yue, Z Zhao, S Xue, J Liu Proceedings of the Asian Conference on Computer Vision, 2439-2455, 2024	1	2024
Collaborative Training of Tiny-Large Vision Language Models S Lu, L Guo, W Wang, Z Zhao, T Yue, J Liu, S Liu Proceedings of the 32nd ACM International Conference on Multimedia, 4928-4937, 2024		2024
Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining H Huang, Y Huo, Z Zhao, H Lu, S Wu, B Wang, Q Liu, W Chen, L Wang arXiv preprint arXiv:2410.16166, 2024		2024

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–15

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores