Suivre
Shusheng Yang
Shusheng Yang
Autres noms杨澍生
PhD student @ NYU Courant
Adresse e-mail validée de nyu.edu
Titre
Citée par
Citée par
Année
Qwen technical report
J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng, Y Fan, W Ge, Y Han, F Huang, ...
Tech Report, 2023
18822023
Qwen-vl: A frontier large vision-language model with versatile abilities
J Bai*, S Bai*, S Yang*, S Wang, S Tan, P Wang, J Lin, C Zhou, J Zhou
Tech Report, 2023
1557*2023
Instances as queries
Y Fang*, S Yang*, X Wang, Y Li, C Fang, Y Shan, B Feng, W Liu
ICCV 2021, 2021
3532021
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
S Tong, E Brown, P Wu, S Woo, M Middepogu, SC Akula, J Yang, S Yang, ...
NeurIPS 2024 Oral, 2024
1722024
Crossover learning for fast online video instance segmentation
S Yang*, Y Fang*, X Wang, Y Li, C Fang, Y Shan, B Feng, W Liu
ICCV 2021, 2021
1322021
Temporally Efficient Vision Transformer for Video Instance Segmentation
S Yang, X Wang, Y Li, Y Fang, J Fang, W Liu, X Zhao, Y Shan
CVPR 2022 Oral, 2022
822022
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
Y Fang*, S Yang*, S Wang*, Y Ge, Y Shan, X Wang
ICCV 2023, 2022
682022
ViTMatte: Boosting image matting with pre-trained plain vision transformers
J Yao, X Wang, S Yang, B Wang
Information Fusion 2024, 2024
542024
Masked Image Modeling with Denoising Contrast
K Yi, Y Ge, X Li, S Yang, D Li, J Wu, Y Shan, X Qie
ICLR 2023, 2022
502022
Touchstone: Evaluating vision-language models by language models
S Bai, S Yang, J Bai, P Wang, X Zhang, J Lin, X Wang, C Zhou, J Zhou
ArXiv 2023, 2023
432023
Tracking Instances as Queries
S Yang*, Y Fang*, X Wang, Y Li, Y Shan, B Feng, W Liu
CVPRW 2021, 2021
142021
RILS: Masked Visual Reconstruction in Language Semantic Space
S Yang, Y Ge, K Yi, D Li, Y Shan, X Qie, X Wang
CVPR 2023, 2023
10*2023
Relational Surrogate Loss Learning
T Huang, Z Li, H Lu, Y Shan, S Yang, Y Feng, F Wang, S You, C Xu
ICLR 2022, 2022
82022
MobileInst: Video Instance Segmentation on the Mobile
R Zhang*, T Cheng*, S Yang, H Jiang, S Zhang, J Lyu, X Li, X Ying, ...
AAAI 2024, 2024
72024
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces
J Yang*, S Yang*, AW Gupta*, R Han*, L Fei-Fei, S Xie
22024
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–15