Seuraa
Shijia Yang
Shijia Yang
Vahvistettu sähköpostiosoite verkkotunnuksessa stanford.edu - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
Time will tell: New outlooks and a baseline for temporal multi-view 3d object detection
J Park, C Xu, S Yang, K Keutzer, K Kitani, M Tomizuka, W Zhan
International Conference on Learning Representations (ICLR), 2023
1742023
Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models
C Xu, S Yang, T Galanti, B Wu, X Yue, B Zhai, W Zhan, P Vajda, K Keutzer, ...
Proceedings of the European Conference on Computer Vision (ECCV), 2022, 2022
862022
Halle-switch: Controlling object hallucination in large vision language models
B Zhai, S Yang, C Xu, S Shen, K Keutzer, M Li
arXiv e-prints, pp. arXiv–2310, 2023
55*2023
Multitask vision-language prompt tuning
S Shen, S Yang, T Zhang, B Zhai, JE Gonzalez, K Keutzer, T Darrell
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
532024
AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
K Gong, K Feng, B Li, Y Wang, M Cheng, S Yang, J Han, B Wang, Y Bai, ...
arXiv preprint arXiv:2412.02611, 2024
22024
Law of vision representation in mllms
S Yang, B Zhai, Q You, J Yuan, H Yang, C Xu
arXiv preprint arXiv:2408.16357, 2024
22024
HallE-Control: controlling object hallucination in large multimodal models
B Zhai, S Yang, C Xu, S Shen, K Keutzer, C Li, M Li
arXiv preprint arXiv:2310.01779, 2023
22023
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–7