Stebėti
Yuhang Zang
Yuhang Zang
Shanghai AI Laboratory
Patvirtintas el. paštas pjlab.org.cn - Pagrindinis puslapis
Pavadinimas
Cituota
Cituota
Metai
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
W Wang, E Xie, X Song, Y Zang, W Wang, T Lu, G Yu, C Shen
IEEE International Conference on Computer Vision (ICCV), 2019
6412019
Seesaw Loss for Long-Tailed Instance Segmentation
J Wang, W Zhang, Y Zang, Y Cao, J Pang, T Gong, K Chen, Z Liu, CC Loy, ...
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
3082021
VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models
H Duan, J Yang, Y Qiao, X Fang, L Chen, Y Liu, X Dong, Y Zang, P Zhang, ...
ACM Multimedia (ACM MM) Open Source Software Competition, 2024
269*2024
Scene Text Detection with Supervised Pyramid Context Network
E Xie, Y Zang, S Shao, G Yu, C Yao, G Li
AAAI Conference on Artificial Intelligence (AAAI), 2019
2642019
InternLM2 Technical Report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
2552024
InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
2362024
Open-Vocabulary DETR with Conditional Matching
Y Zang, W Li, K Zhou, C Huang, CC Loy
European Conference on Computer Vision (ECCV) Oral, 2022
2192022
Are We on the Right Way for Evaluating Large Vision-Language Models?
L Chen, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, J Wang, Y Qiao, ...
Neural Information Processing Systems (NeurIPS), 2024
1792024
Unified Vision and Language Prompt Learning
Y Zang, W Li, K Zhou, C Huang, CC Loy
arXiv preprint arXiv:2210.07225, 2022
1692022
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation
Y Zang, C Huang, CC Loy
IEEE International Conference on Computer Vision (ICCV), 2021
1372021
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ...
Neural Information Processing Systems (NeurIPS), 2024
1182024
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
L Chen, X Wei, J Li, X Dong, P Zhang, Y Zang, Z Chen, H Duan, B Lin, ...
Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks Track, 2024
1042024
Long-CLIP: Unlocking the Long-Text Capability of CLIP
B Zhang, P Zhang, X Dong, Y Zang, J Wang
European Conference on Computer Vision (ECCV), 2024
892024
Contextual Object Detection with Multimodal Large Language Models
Y Zang, W Li, J Han, K Zhou, CC Loy
International Journal of Computer Vision (IJCV), 2023
872023
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
P Zhang, X Dong, Y Zang, Y Cao, R Qian, L Chen, Q Guo, H Duan, ...
arXiv preprint arXiv:2407.03320, 2024
832024
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Z Sun, Y Fang, T Wu, P Zhang, Y Zang, S Kong, Y Xiong, D Lin, J Wang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
562024
Mvsgaussian: Fast generalizable gaussian splatting reconstruction from multi-view stereo
T Liu, G Wang, S Hu, L Shen, X Ye, Y Zang, Z Cao, W Li, Z Liu
European Conference on Computer Vision, 37-53, 2024
30*2024
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs
Z Liu, T Chu, Y Zang, X Wei, X Dong, P Zhang, Z Liang, Y Xiong, Y Qiao, ...
Neural Information Processing Systems (NeurIPS), Datasets and Benchmarks Track, 2024
302024
Streaming Long Video Understanding with Large Language Models
R Qian, X Dong, P Zhang, Y Zang, S Ding, D Lin, J Wang
Neural Information Processing Systems (NeurIPS), 2024
232024
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
P Ling, J Bu, P Zhang, X Dong, Y Zang, T Wu, H Chen, J Wang, Y Jin
arXiv preprint arXiv:2406.05338, 2024
232024
Sistema negali atlikti operacijos. Bandykite vėliau dar kartą.
Straipsniai 1–20