עקוב אחר
Jiannan Wu
Jiannan Wu
כתובת אימייל מאומתת בדומיין connect.hku.hk - דף הבית
כותרת
צוטט על ידי
צוטט על ידי
שנה
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks
Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
683*2024
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks
W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ...
Advances in Neural Information Processing Systems (NeurIPS), 2023
4482023
Universal instance perception as object discovery and retrieval
B Yan, Y Jiang, J Wu, D Wang, P Luo, Z Yuan, H Lu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
1732023
Language as queries for referring video object segmentation
J Wu, Y Jiang, P Sun, Z Yuan, P Luo
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1712022
Watch only once: An end-to-end video action detection framework
S Chen, P Sun, E Xie, C Ge, J Wu, L Ma, J Shen, P Luo
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
742021
Visionllm v2: An end-to-end generalist multimodal large language model for hundreds of vision-language tasks
J Wu, M Zhong, S Xing, Z Lai, Z Liu, Z Chen, W Wang, X Zhu, L Lu, T Lu, ...
Advances in Neural Information Processing Systems 37, 69925-69975, 2025
332025
Groma: Localized visual tokenization for grounding multimodal large language models
C Ma, Y Jiang, J Wu, Z Yuan, X Qi
European Conference on Computer Vision, 417-435, 2024
332024
Self-supervised video representation learning with motion-aware masked autoencoders
H Yang, D Huang, B Wen, J Wu, H Yao, Y Jiang, X Zhu, Z Yuan
arXiv preprint arXiv:2210.04154, 2022
202022
The first visual object tracking segmentation vots2023 challenge results
M Kristan, J Matas, M Danelljan, M Felsberg, HJ Chang, LČ Zajc, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
132023
Development of an effective model for computing rightmost eigenvalues of power systems with inclusion of time delays
C Li, J Wu, C Duan, Z Du
IEEE Transactions on Power Systems 34 (6), 4216-4227, 2019
132019
Segment every reference object in spatial and temporal spaces
J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
122023
Towards high-quality temporal action detection with sparse proposals
J Wu, P Sun, S Chen, J Yang, Z Qi, L Ma, P Luo
arXiv preprint arXiv:2109.08847, 2021
112021
Exploring transformers for open-world instance segmentation
J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
82023
Uniref++: Segment every reference object in spatial and temporal spaces
J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo
arXiv preprint arXiv:2312.15715, 2023
52023
Multi-level contrastive learning for dense prediction task
Q Guo, Y Yu, Y Jiang, J Wu, Z Yuan, P Luo
arXiv preprint arXiv:2304.02010, 2023
32023
A Simple Baseline for Open-World Tracking via Self-training
B Wang, T Li, J Wu, Y Jiang, H Lu, Y He
Proceedings of the 31st ACM International Conference on Multimedia, 2765-2774, 2023
22023
Method, apparatus, device, and medium for processing visual task by generic model
Y Jiang, B Yan, J Wu, Y Zehuan
US Patent App. 18/531,091, 2024
2024
Method, apparatus, device and medium for processing image using machine learning model
Y Jiang, J Wu, B Yan, Y Zehuan
US Patent App. 18/499,066, 2024
2024
MotionMAE: Self-supervised Video Representation Learning with Motion-Aware Masked Auto encoders
H Yang, D Huang, B Wen, J Wu, H Yao, Y Jiang, X Zhu, Z Yuan
2024
המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.
מאמרים 1–19