Yining Li

Hivatkozott rá

	Összes	2020 óta
Hivatkozások	3571	3272
h-index	17	16
i10-index	19	19

1400

700

350

1050

20172018201920202021202220232024202541 78 170 251 425 452 547 1397 197

Nyilvános hozzáférés

Összes megtekintése

6 cikk

0 cikk

elérhető

nem érhető el

Finanszírozási megbízások alapján

Társszerzők

Chen Change LoyPresident's Chair Professor, MMLab@NTU, S-Lab, Nanyang Technological UniversityE-mail megerősítve itt: ntu.edu.sg
Kai ChenShanghai AI LaboratoryE-mail megerősítve itt: pjlab.org.cn
Xiaoou TangThe Chinese University of Hong KongE-mail megerősítve itt: ie.cuhk.edu.hk
Xiangtai LiResearch Scientist, Bytedance Seed, SG; Nanyang Technological UniversityE-mail megerősítve itt: pku.edu.cn
Tao JiangShanghai AI LaboratoryE-mail megerősítve itt: pjlab.org.cn
Peng LuTsinghua UniversityE-mail megerősítve itt: mails.tsinghua.edu.cn
Chengqi LyuShanghai AI LaboratoryE-mail megerősítve itt: pjlab.org.cn
Chen HuangResearch Scientist, Apple IncE-mail megerősítve itt: apple.com

Követés

Yining Li

Shanghai AI Laboratory

E-mail megerősítve itt: pjlab.org.cn - Kezdőlap

Multimodal Learning Large Language Model


Cím Rendezés hivatkozások szerint Rendezés év szerint Rendezés cím szerint	Hivatkozott rá Hivatkozott rá	Év
Learning deep representation for imbalanced classification C Huang, Y Li, CC Loy, X Tang Proceedings of the IEEE conference on computer vision and pattern …, 2016	1316	2016
Deep imbalanced learning for face recognition and attribute prediction C Huang, Y Li, CC Loy, X Tang IEEE transactions on pattern analysis and machine intelligence 42 (11), 2781 …, 2019	407	2019
Openmmlab pose estimation toolbox and benchmark MMP Contributors	376	2020
Dense intrinsic appearance flow for human pose transfer Y Li, C Huang, CC Loy Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	220	2019
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024	218	2024
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024	216	2024
Human attribute recognition by deep hierarchical contexts Y Li, C Huang, CC Loy, X Tang Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016	204	2016
Rtmpose: Real-time multi-person pose estimation based on mmpose T Jiang, P Lu, L Zhang, N Ma, R Han, C Lyu, Y Li, K Chen arXiv preprint arXiv:2303.07399, 2023	172	2023
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ... arXiv preprint arXiv:2404.06512, 2024	109	2024
Internlm-xcomposer-2.5: A versatile large vision language model supporting long-contextual input and output P Zhang, X Dong, Y Zang, Y Cao, R Qian, L Chen, Q Guo, H Duan, ... arXiv preprint arXiv:2407.03320, 2024	69	2024
OMG-Seg: Is one model good enough for all segmentation? X Li, H Yuan, W Li, H Ding, S Wu, W Zhang, Y Li, K Chen, CC Loy Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	44	2024
Open-vocabulary SAM: Segment and recognize twenty-thousand classes interactively H Yuan, X Li, C Zhou, Y Li, K Chen, CC Loy European Conference on Computer Vision, 419-437, 2024	33	2024
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation P Lu, T Jiang, Y Li, X Li, K Chen, W Yang Proceedings of the IEEE conference on computer vision and pattern recognition, 2024	30	2024
MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding X Fang, K Mao, H Duan, X Zhao, Y Li, D Lin, K Chen arXiv preprint arXiv:2406.14515, 2024	29	2024
Learning to disambiguate by asking discriminative questions Y Li, C Huang, X Tang, C Change Loy Proceedings of the IEEE International Conference on Computer Vision, 3419-3428, 2017	29	2017
An open and comprehensive pipeline for unified object grounding and detection X Zhao, Y Chen, S Xu, X Li, X Wang, Y Li, H Huang arXiv preprint arXiv:2401.02361, 2024	23	2024
Towards language-driven video inpainting via multimodal large language models J Wu, X Li, C Si, S Zhou, J Yang, J Zhang, Y Li, K Chen, Y Tong, Z Liu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	21	2024
Motionbooth: Motion-aware customized text-to-video generation J Wu, X Li, Y Zeng, J Zhang, Q Zhou, Y Li, Y Tong, K Chen arXiv preprint arXiv:2406.17758, 2024	14	2024
Dst-det: Simple dynamic self-training for open-vocabulary object detection S Xu, X Li, S Wu, W Zhang, Y Li, G Cheng, Y Tong, K Chen, CC Loy arXiv preprint arXiv:2310.01393, 2023	11	2023
Mg-llava: Towards multi-granularity visual instruction tuning X Zhao, X Li, H Duan, H Huang, Y Li, K Chen, H Yang arXiv preprint arXiv:2406.17770, 2024	8	2024

A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.

Cikkek 1–20

Hivatkozások évente

Ismétlődő hivatkozások

Összevont hivatkozások

Társszerzők hozzáadásaTársszerzők

Követés

Hivatkozott rá

Társszerzők