Sledovat
Minghe Gao
Minghe Gao
E-mailová adresa ověřena na: zju.edu.cn
Název
Citace
Citace
Rok
Fine-tuning multimodal llms to follow zero-shot demonstrative instructions
J Li, K Pan, Z Ge, M Gao, W Ji, W Zhang, TS Chua, S Tang, H Zhang, ...
The Twelfth International Conference on Learning Representations, 2023
702023
Gradient-regulated meta-prompt learning for generalizable vision-language models
J Li, M Gao, L Wei, S Tang, W Zhang, M Li, W Ji, Q Tian, TS Chua, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
262023
Empowering vision-language models to follow interleaved vision-language instructions
J Li, K Pan, Z Ge, M Gao, H Zhang, W Ji, W Zhang, TS Chua, S Tang, ...
arXiv preprint arXiv:2308.04152, 2023
192023
De-fine: Decomposing and Refining Visual Programs with Auto-Feedback
M Gao, J Li, H Fei, L Pang, W Ji, G Wang, Z Lv, W Zhang, S Tang, ...
Proceedings of the 32nd ACM International Conference on Multimedia, 7649-7657, 2024
92024
Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales
M Gao, S Chen, L Pang, Y Yao, J Dang, W Zhang, J Li, S Tang, Y Zhuang, ...
Proceedings of the 32nd ACM International Conference on Multimedia, 846-855, 2024
42024
Generalist virtual agents: A survey on autonomous agents across digital platforms
M Gao, W Bu, B Miao, Y Wu, Y Li, J Li, S Tang, Q Wu, Y Zhuang, M Wang
arXiv preprint arXiv:2411.10943, 2024
32024
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining
Z Ge, J Li, X Pang, M Gao, K Pan, W Lin, H Fei, W Zhang, S Tang, ...
arXiv preprint arXiv:2412.10342, 2024
12024
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training
H Qiu, M Gao, L Qian, K Pan, Q Yu, J Li, W Wang, S Tang, Y Zhuang, ...
arXiv preprint arXiv:2412.00161, 2024
2024
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–8