Følg
Yining Li
Yining Li
Shanghai AI Laboratory
Verifisert e-postadresse på pjlab.org.cn - Startside
Tittel
Sitert av
Sitert av
År
Learning deep representation for imbalanced classification
C Huang, Y Li, CC Loy, X Tang
Proceedings of the IEEE conference on computer vision and pattern …, 2016
13142016
Deep imbalanced learning for face recognition and attribute prediction
C Huang, Y Li, CC Loy, X Tang
IEEE transactions on pattern analysis and machine intelligence 42 (11), 2781 …, 2019
4072019
Openmmlab pose estimation toolbox and benchmark
MMP Contributors
3822020
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
2462024
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
2322024
Dense intrinsic appearance flow for human pose transfer
Y Li, C Huang, CC Loy
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
2202019
Human attribute recognition by deep hierarchical contexts
Y Li, C Huang, CC Loy, X Tang
Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016
2072016
Rtmpose: Real-time multi-person pose estimation based on mmpose
T Jiang, P Lu, L Zhang, N Ma, R Han, C Lyu, Y Li, K Chen
arXiv preprint arXiv:2303.07399, 2023
1782023
Internlm-xcomposer2-4khd: A pioneering large vision-language model handling resolutions from 336 pixels to 4k hd
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ...
Advances in Neural Information Processing Systems 37, 42566-42592, 2025
1112025
Internlm-xcomposer-2.5: A versatile large vision language model supporting long-contextual input and output
P Zhang, X Dong, Y Zang, Y Cao, R Qian, L Chen, Q Guo, H Duan, ...
arXiv preprint arXiv:2407.03320, 2024
812024
Omg-seg: Is one model good enough for all segmentation?
X Li, H Yuan, W Li, H Ding, S Wu, W Zhang, Y Li, K Chen, CC Loy
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2024
492024
Open-vocabulary SAM: Segment and recognize twenty-thousand classes interactively
H Yuan, X Li, C Zhou, Y Li, K Chen, CC Loy
European Conference on Computer Vision, 419-437, 2024
382024
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation
P Lu, T Jiang, Y Li, X Li, K Chen, W Yang
Proceedings of the IEEE conference on computer vision and pattern recognition, 2024
332024
Mmbench-video: A long-form multi-shot benchmark for holistic video understanding
X Fang, K Mao, H Duan, X Zhao, Y Li, D Lin, K Chen
Advances in Neural Information Processing Systems 37, 89098-89124, 2025
322025
Learning to disambiguate by asking discriminative questions
Y Li, C Huang, X Tang, C Change Loy
Proceedings of the IEEE International Conference on Computer Vision, 3419-3428, 2017
302017
An open and comprehensive pipeline for unified object grounding and detection
X Zhao, Y Chen, S Xu, X Li, X Wang, Y Li, H Huang
arXiv preprint arXiv:2401.02361, 2024
252024
Towards language-driven video inpainting via multimodal large language models
J Wu, X Li, C Si, S Zhou, J Yang, J Zhang, Y Li, K Chen, Y Tong, Z Liu, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
202024
Motionbooth: Motion-aware customized text-to-video generation
J Wu, X Li, Y Zeng, J Zhang, Q Zhou, Y Li, Y Tong, K Chen
arXiv preprint arXiv:2406.17758, 2024
182024
Dst-det: Simple dynamic self-training for open-vocabulary object detection
S Xu, X Li, S Wu, W Zhang, Y Tong, CC Loy
arXiv preprint arXiv:2310.01393, 2023
112023
Mg-llava: Towards multi-granularity visual instruction tuning
X Zhao, X Li, H Duan, H Huang, Y Li, K Chen, H Yang
arXiv preprint arXiv:2406.17770, 2024
82024
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–20