Jiannan Wu

צוטט על ידי

	הכל	מאז 2020
ציטוטים ביבליוגרפיים	1702	1702
H-index	11	11
i10-index	12	12

1200

600

300

900

202120222023202420256 27 186 1173 308

גישה ציבורית

הצג הכל

9 מאמרים

מאמר אחד

זמין

לא זמין

על סמך ייפוי כח מהמממנים

מחברים משותפים

Ping Luo (羅平)Associate Professor, The University of Hong Kong; MMLAB@HKUכתובת אימייל מאומתת בדומיין hku.hk
Wenhai Wang (王文海)CUHK | Shanghai AI Laboratory | NJUכתובת אימייל מאומתת בדומיין cuhk.edu.hk
Yi JiangBytedanceכתובת אימייל מאומתת בדומיין bytedance.com
Zehuan YuanBytedance Inc.כתובת אימייל מאומתת בדומיין bytedance.com
Zhe Chen (陈喆)PhD candidate, Nanjing Universityכתובת אימייל מאומתת בדומיין smail.nju.edu.cn
Jifeng DaiAssociate Professor of EE, Tsinghua University; Adjuct Researcher of Shanghai AI Laboratoryכתובת אימייל מאומתת בדומיין tsinghua.edu.cn
Bin YanPhD student of Computer Vision, Dalian University of Technologyכתובת אימייל מאומתת בדומיין mail.dlut.edu.cn
Peize SunMeta FAIR; HKUכתובת אימייל מאומתת בדומיין meta.com

עקוב אחר

Jiannan Wu

The University of Hong Kong

כתובת אימייל מאומתת בדומיין connect.hku.hk - דף הבית

Computer Vision Video Understanding Multimodal LLMs


כותרת מיון לפי ציטוט ביבליוגרפי מיון לפי שנה מיון לפי כותרת	צוטט על ידי צוטט על ידי	שנה
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks‏ Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ...‏ IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024‏	683*	2024
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks‏ W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ...‏ Advances in Neural Information Processing Systems (NeurIPS), 2023‏	448	2023
Universal instance perception as object discovery and retrieval‏ B Yan, Y Jiang, J Wu, D Wang, P Luo, Z Yuan, H Lu‏ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023‏	173	2023
Language as queries for referring video object segmentation‏ J Wu, Y Jiang, P Sun, Z Yuan, P Luo‏ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022‏	171	2022
Watch only once: An end-to-end video action detection framework‏ S Chen, P Sun, E Xie, C Ge, J Wu, L Ma, J Shen, P Luo‏ Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021‏	74	2021
Visionllm v2: An end-to-end generalist multimodal large language model for hundreds of vision-language tasks‏ J Wu, M Zhong, S Xing, Z Lai, Z Liu, Z Chen, W Wang, X Zhu, L Lu, T Lu, ...‏ Advances in Neural Information Processing Systems 37, 69925-69975, 2025‏	33	2025
Groma: Localized visual tokenization for grounding multimodal large language models‏ C Ma, Y Jiang, J Wu, Z Yuan, X Qi‏ European Conference on Computer Vision, 417-435, 2024‏	33	2024
Self-supervised video representation learning with motion-aware masked autoencoders‏ H Yang, D Huang, B Wen, J Wu, H Yao, Y Jiang, X Zhu, Z Yuan‏ arXiv preprint arXiv:2210.04154, 2022‏	20	2022
The first visual object tracking segmentation vots2023 challenge results‏ M Kristan, J Matas, M Danelljan, M Felsberg, HJ Chang, LČ Zajc, ...‏ Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023‏	13	2023
Development of an effective model for computing rightmost eigenvalues of power systems with inclusion of time delays‏ C Li, J Wu, C Duan, Z Du‏ IEEE Transactions on Power Systems 34 (6), 4216-4227, 2019‏	13	2019
Segment every reference object in spatial and temporal spaces‏ J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo‏ Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023‏	12	2023
Towards high-quality temporal action detection with sparse proposals‏ J Wu, P Sun, S Chen, J Yang, Z Qi, L Ma, P Luo‏ arXiv preprint arXiv:2109.08847, 2021‏	11	2021
Exploring transformers for open-world instance segmentation‏ J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo‏ Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023‏	8	2023
Uniref++: Segment every reference object in spatial and temporal spaces‏ J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo‏ arXiv preprint arXiv:2312.15715, 2023‏	5	2023
Multi-level contrastive learning for dense prediction task‏ Q Guo, Y Yu, Y Jiang, J Wu, Z Yuan, P Luo‏ arXiv preprint arXiv:2304.02010, 2023‏	3	2023
A Simple Baseline for Open-World Tracking via Self-training‏ B Wang, T Li, J Wu, Y Jiang, H Lu, Y He‏ Proceedings of the 31st ACM International Conference on Multimedia, 2765-2774, 2023‏	2	2023
Method, apparatus, device, and medium for processing visual task by generic model‏ Y Jiang, B Yan, J Wu, Y Zehuan‏ US Patent App. 18/531,091, 2024‏		2024
Method, apparatus, device and medium for processing image using machine learning model‏ Y Jiang, J Wu, B Yan, Y Zehuan‏ US Patent App. 18/499,066, 2024‏		2024
MotionMAE: Self-supervised Video Representation Learning with Motion-Aware Masked Auto encoders‏ H Yang, D Huang, B Wen, J Wu, H Yao, Y Jiang, X Zhu, Z Yuan‏		2024

המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.

מאמרים 1–19

ציטוטים ביבליוגרפיים בשנה

ציטוטים ביביליוגרפיים כפולים

ציטוטים ביביליוגרפיים שמוזגו

הוסף מחברים שותפיםמחברים משותפים

עקוב אחר

צוטט על ידי

מחברים משותפים