Theo dõi
Jitesh Jain
Tiêu đề
Trích dẫn bởi
Trích dẫn bởi
Năm
OneFormer: One Transformer to Rule Universal Image Segmentation
J Jain, J Li*, MT Chiu*, A Hassani, N Orlov, H Shi
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
3902023
SeMask: Semantically Masked Transformers for Semantic Segmentation
J Jain, A Singh, N Orlov, Z Huang, J Li, S Walton, H Shi
IEEE International Conference on Computer Vision (ICCV) Workshops, 2023
1382023
Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand
J Jain*, Y Zhou*, N Yu, H Shi
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023
692023
Vcoder: Versatile vision encoders for multimodal large language models
J Jain, J Yang, H Shi
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
372024
Cumo: Scaling multimodal llm with co-upcycled mixture-of-experts
J Li, X Wang, S Zhu, CW Kuo, L Xu, F Chen, J Jain, H Shi, L Wen
Advances in Neural Information Processing Systems (NeurIPS), 2024
262024
Matting anything
J Li, J Jain, H Shi
IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops …, 2024
212024
Benchmarking Object Detectors with COCO: A New Path Forward
S Singh*, A Yadav*, J Jain, H Shi, J Johnson, K Desai
European Conference on Computer Vision (ECCV), 2024
22024
Towards Responsible Use of Large Multi-modal AI to Analyze Human Social Behaviors
Q Zheng, X Lu, Q Jin, J Jain, H Meadan-Kaplansky, H Shi, J Xiong, ...
Companion Publication of the 2024 Conference on Computer-Supported …, 2024
12024
Deap cache: Deep eviction admission and prefetching for cache
A Mangal*, J Jain*, KK Guliani*, O Bhalerao*
arXiv preprint arXiv:2009.09206, 2020
12020
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation
J Jain, Z Yang, H Shi, J Gao, J Yang
arXiv preprint arXiv:2412.09585, 2024
2024
Hệ thống không thể thực hiện thao tác ngay bây giờ. Hãy thử lại sau.
Bài viết 1–10