Shu Zhang

نقل شده توسط

	همهٔ موارد	از 2020
نقل‌‏قول‌‏ها	1078	952
شاخص h	10	10
شاخص i10	11	10

440

220

110

330

2014201520162017201820192020202120222023202420253 12 15 26 28 39 56 86 117 191 431 71

دسترسی عمومی

مشاهدهٔ همه

۲ مقاله

۰ مقاله

در دسترس

در دسترس نیست

براساس دستورات هزینه انتشار

نویسندگان مشترک

Ran XuSalesforce Researchایمیل تأیید شده در salesforce.com
Caiming XiongSalesforce Researchایمیل تأیید شده در salesforce.com
Ning YuNetflix Eyeline Studiosایمیل تأیید شده در scanlinevfx.com
Amit K. Roy-ChowdhuryProfessor and UC Presidential Chair, UC Riverside; Fellow IEEE, IAPRایمیل تأیید شده در ece.ucr.edu
Can QinSalesforceایمیل تأیید شده در salesforce.com
Yihao FengApple AIMLایمیل تأیید شده در apple.com
Juan Carlos NieblesResearch Director (Salesforce) & Adjunct Professor (Stanford University)ایمیل تأیید شده در cs.stanford.edu
Yingying ZhuGoogle Inc.ایمیل تأیید شده در ieee.org
Chenyou FanSouth China Normal University, Indiana University Bloomingtonایمیل تأیید شده در m.scnu.edu.cn
Qi ZhuProfessor of Computer Engineeringایمیل تأیید شده در northwestern.edu
Abir DasAssistant Professor at IIT Kharagpurایمیل تأیید شده در cse.iitkgp.ac.in

دنبال کردن

Shu Zhang

Salesforce Inc.

ایمیل تأیید شده در salesforce.com

computer vision image generation 3D understanding multi-modal


عنوان به‌ترتیب نقل قول‌ها به‌ترتیب سال به‌ترتیب عنوان	نقل شده توسط نقل شده توسط	سال
Heterogeneous memory enhanced multimodal attention model for video question answering‏ C Fan, X Zhang, S Zhang, W Wang, C Zhang, H Huang‏ Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019‏	332	2019
Unicontrol: A unified diffusion model for controllable visual generation in the wild‏ C Qin, S Zhang, N Yu, Y Feng, X Yang, Y Zhou, H Wang, JC Niebles, ...‏ arXiv preprint arXiv:2305.11147, 2023‏	108	2023
Ulip-2: Towards scalable multimodal pre-training for 3d understanding‏ L Xue, N Yu, S Zhang, A Panagopoulou, J Li, R Martín-Martín, J Wu, ...‏ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024‏	107	2024
Context-aware surveillance video summarization‏ S Zhang, Y Zhu, AK Roy-Chowdhury‏ IEEE Transactions on Image Processing 25 (11), 5469-5478, 2016‏	105	2016
Hive: Harnessing human feedback for instructional visual editing‏ S Zhang, X Yang, Y Feng, C Qin, CC Chen, N Yu, Z Chen, H Wang, ...‏ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024‏	100	2024
Use all the labels: A hierarchical multi-label contrastive learning framework‏ S Zhang, R Xu, C Xiong, C Ramaiah‏ Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022‏	95	2022
A camera network tracking (camnet) dataset and performance baseline‏ S Zhang, E Staudt, T Faltemier, AK Roy-Chowdhury‏ 2015 IEEE winter conference on applications of computer vision, 365-372, 2015‏	82	2015
Tracking multiple interacting targets in a camera network‏ S Zhang, Y Zhu, A Roy-Chowdhury‏ Computer Vision and Image Understanding 134, 64-73, 2015‏	50	2015
xgen-mm (blip-3): A family of open large multimodal models‏ L Xue, M Shu, A Awadalla, J Wang, A Yan, S Purushwalkam, H Zhou, ...‏ arXiv preprint arXiv:2408.08872, 2024‏	49	2024
Gluegen: Plug and play multi-modal encoders for x-to-image generation‏ C Qin, N Yu, C Xing, S Zhang, Z Chen, S Ermon, Y Fu, C Xiong, R Xu‏ Proceedings of the IEEE/CVF international conference on computer vision …, 2023‏	23	2023
Video summarization through change detection in a non-overlapping camera network‏ S Zhang, AK Roy-Chowdhury‏ 2015 IEEE International Conference on Image Processing (ICIP), 3832-3836, 2015‏	10	2015
Online social behavior modeling for multi-target tracking‏ S Zhang, A Das, C Ding, A Roy-Chowdhury‏ Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2013‏	8	2013
Adaptive algorithm selection, with applications in pedestrian detection‏ S Zhang, Q Zhu, A Roy-Chowdhury‏ 2016 IEEE International Conference on Image Processing (ICIP), 3768-3772, 2016‏	5	2016
Systems and methods for vision-language distribution alignment‏ S Zhang, LI Junnan, R Xu, C Xiong, C Ramaiah‏ US Patent 12,112,523, 2024‏	2	2024
Template-based key-value extraction for inferring OCR key values within form images‏ S Zhang, C Ramaiah, R Xu, C Xiong‏ US Patent 11,495,011, 2022‏	1	2022
Adaptive algorithm and platform selection for visual detection and tracking‏ S Zhang, Q Zhu, A Roy-Chowdhury‏ arXiv preprint arXiv:1605.06597, 2016‏	1	2016
SYSTEMS AND METHODS FOR CONTROLLABLE IMAGE GENERATION‏ N YU, C Qin, S Zhang, Y Feng, X Yang, R XU‏ US Patent App. 18/477,764, 2024‏		2024
Systems and methods for multimodal pretraining for three-dimensional understanding models‏ L Xue, N Yu, S Zhang, LI Junnan, C Xiong, S Savarese, JCN Duque, R Xu‏ US Patent App. 18/493,035, 2024‏		2024
Systems and methods for feedback based instructional visual editing‏ S Zhang, X Yang, Y Feng, R Xu, N Yu, CC Chen‏ US Patent App. 18/350,876, 2024‏		2024
Systems and methods for text-to-image generation using language models‏ N Yu, C Qin, C Xing, S Zhang, S Ermon, C Xiong, R Xu‏ US Patent App. 18/162,535, 2024‏		2024

سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.

مقاله‌ها 1–20

نقل‌قول‌ها در سال

نقل‌قول تکراری

نقل‌قول‌های ادغام شده

افزودن نویسنده‌های همکارنویسندگان مشترک

دنبال کردن

نقل شده توسط

نویسندگان مشترک