Theo dõi
Shusheng Yang
Shusheng Yang
PhD student @ NYU Courant
Email được xác minh tại nyu.edu
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
Qwen technical report
J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng, Y Fan, W Ge, Y Han, F Huang, ...
Tech Report, 2023
25122023
Qwen-vl: A frontier large vision-language model with versatile abilities
J Bai*, S Bai*, S Yang*, S Wang, S Tan, P Wang, J Lin, C Zhou, J Zhou
Tech Report, 2023
11022023
Instances as queries
Y Fang*, S Yang*, X Wang, Y Li, C Fang, Y Shan, B Feng, W Liu
ICCV 2021, 2021
3602021
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
S Tong, E Brown, P Wu, S Woo, M Middepogu, SC Akula, J Yang, S Yang, ...
NeurIPS 2024 Oral, 2024
2072024
Crossover learning for fast online video instance segmentation
S Yang*, Y Fang*, X Wang, Y Li, C Fang, Y Shan, B Feng, W Liu
ICCV 2021, 2021
1322021
Temporally Efficient Vision Transformer for Video Instance Segmentation
S Yang, X Wang, Y Li, Y Fang, J Fang, W Liu, X Zhao, Y Shan
CVPR 2022 Oral, 2022
822022
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
Y Fang*, S Yang*, S Wang*, Y Ge, Y Shan, X Wang
ICCV 2023, 2022
672022
ViTMatte: Boosting image matting with pre-trained plain vision transformers
J Yao, X Wang, S Yang, B Wang
Information Fusion 2024, 2024
592024
Masked Image Modeling with Denoising Contrast
K Yi, Y Ge, X Li, S Yang, D Li, J Wu, Y Shan, X Qie
ICLR 2023, 2022
532022
Touchstone: Evaluating vision-language models by language models
S Bai, S Yang, J Bai, P Wang, X Zhang, J Lin, X Wang, C Zhou, J Zhou
ArXiv 2023, 2023
442023
Tracking Instances as Queries
S Yang*, Y Fang*, X Wang, Y Li, Y Shan, B Feng, W Liu
CVPRW 2021, 2021
142021
RILS: Masked Visual Reconstruction in Language Semantic Space
S Yang, Y Ge, K Yi, D Li, Y Shan, X Qie, X Wang
CVPR 2023, 2023
10*2023
MobileInst: Video Instance Segmentation on the Mobile
R Zhang*, T Cheng*, S Yang, H Jiang, S Zhang, J Lyu, X Li, X Ying, ...
AAAI 2024, 2024
82024
Relational Surrogate Loss Learning
T Huang, Z Li, H Lu, Y Shan, S Yang, Y Feng, F Wang, S You, C Xu
ICLR 2022, 2022
82022
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
J yang*, S Yang*, A Gupta*, R Han*, L Fei-Fei, S Xie
42024
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–15