Yining Li

Sitert av

	Alle	Siden 2020
Sitater	3664	3370
h-indeks	18	17
i10-indeks	19	19

1500

750

375

1125

20172018201920202021202220232024202541 76 169 253 422 451 552 1420 271

Offentlig tilgang

Vis alle

5 artikler

0 artikler

tilgjengelige

ikke tilgjengelige

Basert på finansieringsmandater

Medforfattere

Chen Change LoyPresident's Chair Professor, MMLab@NTU, S-Lab, Nanyang Technological UniversityVerifisert e-postadresse på ntu.edu.sg
Kai ChenShanghai AI LaboratoryVerifisert e-postadresse på pjlab.org.cn
Xiaoou TangThe Chinese University of Hong KongVerifisert e-postadresse på ie.cuhk.edu.hk
Xiangtai LiResearch Scientist, Bytedance Seed, SG; Nanyang Technological UniversityVerifisert e-postadresse på pku.edu.cn
Tao JiangShanghai AI LaboratoryVerifisert e-postadresse på pjlab.org.cn
Peng LuTsinghua UniversityVerifisert e-postadresse på mails.tsinghua.edu.cn
Chengqi LyuShanghai AI LaboratoryVerifisert e-postadresse på pjlab.org.cn
Chen HuangResearch Scientist, Apple IncVerifisert e-postadresse på apple.com

Følg

Yining Li

Shanghai AI Laboratory

Verifisert e-postadresse på pjlab.org.cn - Startside

Multimodal Learning Large Language Model


Tittel Sorter etter sitater Sorter etter år Sorter etter tittel	Sitert av Sitert av	År
Learning deep representation for imbalanced classification C Huang, Y Li, CC Loy, X Tang Proceedings of the IEEE conference on computer vision and pattern …, 2016	1314	2016
Deep imbalanced learning for face recognition and attribute prediction C Huang, Y Li, CC Loy, X Tang IEEE transactions on pattern analysis and machine intelligence 42 (11), 2781 …, 2019	407	2019
Openmmlab pose estimation toolbox and benchmark MMP Contributors	382	2020
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024	246	2024
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024	232	2024
Dense intrinsic appearance flow for human pose transfer Y Li, C Huang, CC Loy Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019	220	2019
Human attribute recognition by deep hierarchical contexts Y Li, C Huang, CC Loy, X Tang Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016	207	2016
Rtmpose: Real-time multi-person pose estimation based on mmpose T Jiang, P Lu, L Zhang, N Ma, R Han, C Lyu, Y Li, K Chen arXiv preprint arXiv:2303.07399, 2023	178	2023
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ... Advances in Neural Information Processing Systems 37, 42566-42592, 2025	111	2025
Internlm-xcomposer-2.5: A versatile large vision language model supporting long-contextual input and output P Zhang, X Dong, Y Zang, Y Cao, R Qian, L Chen, Q Guo, H Duan, ... arXiv preprint arXiv:2407.03320, 2024	81	2024
Omg-seg: Is one model good enough for all segmentation? X Li, H Yuan, W Li, H Ding, S Wu, W Zhang, Y Li, K Chen, CC Loy Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2024	49	2024
Open-vocabulary SAM: Segment and recognize twenty-thousand classes interactively H Yuan, X Li, C Zhou, Y Li, K Chen, CC Loy European Conference on Computer Vision, 419-437, 2024	38	2024
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation P Lu, T Jiang, Y Li, X Li, K Chen, W Yang Proceedings of the IEEE conference on computer vision and pattern recognition, 2024	33	2024
Mmbench-video: A long-form multi-shot benchmark for holistic video understanding X Fang, K Mao, H Duan, X Zhao, Y Li, D Lin, K Chen Advances in Neural Information Processing Systems 37, 89098-89124, 2025	32	2025
Learning to disambiguate by asking discriminative questions Y Li, C Huang, X Tang, C Change Loy Proceedings of the IEEE International Conference on Computer Vision, 3419-3428, 2017	30	2017
An open and comprehensive pipeline for unified object grounding and detection X Zhao, Y Chen, S Xu, X Li, X Wang, Y Li, H Huang arXiv preprint arXiv:2401.02361, 2024	25	2024
Towards language-driven video inpainting via multimodal large language models J Wu, X Li, C Si, S Zhou, J Yang, J Zhang, Y Li, K Chen, Y Tong, Z Liu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	20	2024
Motionbooth: Motion-aware customized text-to-video generation J Wu, X Li, Y Zeng, J Zhang, Q Zhou, Y Li, Y Tong, K Chen arXiv preprint arXiv:2406.17758, 2024	18	2024
Dst-det: Simple dynamic self-training for open-vocabulary object detection S Xu, X Li, S Wu, W Zhang, Y Tong, CC Loy arXiv preprint arXiv:2310.01393, 2023	11	2023
Mg-llava: Towards multi-granularity visual instruction tuning X Zhao, X Li, H Duan, H Huang, Y Li, K Chen, H Yang arXiv preprint arXiv:2406.17770, 2024	8	2024

Systemet kan ikke utføre handlingen. Prøv på nytt senere.

Artikler 1–20

Sitater per år

Duplikatsitater

Sammenslåtte sitater

Legg til medforfattereMedforfattere

Følg

Sitert av

Medforfattere