Følg
Sen Xing
Sen Xing
Verifisert e-postadresse på mails.tsinghua.edu.cn - Startside
Tittel
Sitert av
Sitert av
År
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks
Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, M Zhong, Q Zhang, X Zhu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
617*2024
Internvideo: General video foundation models via generative and discriminative learning
Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ...
arXiv preprint arXiv:2212.03191, 2022
3392022
Internvideo-ego4d: A pack of champion solutions to ego4d challenges
G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ...
arXiv preprint arXiv:2211.09529, 2022
442022
Visionllm v2: An end-to-end generalist multimodal large language model for hundreds of vision-language tasks
J Wu, M Zhong, S Xing, Z Lai, Z Liu, Z Chen, W Wang, X Zhu, L Lu, T Lu, ...
Advances in Neural Information Processing Systems 37, 69925-69975, 2025
332025
Asymmetric masked distillation for pre-training small foundation models
Z Zhao, B Huang, S Xing, G Wu, Y Qiao, L Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
82024
Mulan: Adapting multilingual diffusion models for hundreds of languages with negligible cost
S Xing, M Zhong, Z Lai, L Li, J Liu, Y Wang, J Dai, W Wang
arXiv preprint arXiv:2412.01271, 2024
12024
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–6