Подписаться
Yang Jiao
Yang Jiao
Подтвержден адрес электронной почты в домене m.fudan.edu.cn - Главная страница
Название
Процитировано
Процитировано
Год
Msmdfusion: Fusing lidar and camera at multiple scales with multi-depth seeds for 3d object detection
Y Jiao, Z Jie, S Chen, J Chen, L Ma, YG Jiang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
1062023
Nuscenes-qa: A multi-modal visual question answering benchmark for autonomous driving scenario
T Qian, J Chen, L Zhuo, Y Jiao, YG Jiang
Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 4542-4550, 2024
1042024
More: Multi-order relation mining for dense captioning in 3d scenes
Y Jiao, S Chen, Z Jie, J Chen, L Ma, YG Jiang
European Conference on Computer Vision, 528-545, 2022
432022
Two-stage visual cues enhancement network for referring image segmentation
Y Jiao, Z Jie, W Luo, J Chen, YG Jiang, X Wei, L Ma
Proceedings of the 29th ACM international conference on multimedia, 1331-1340, 2021
262021
Lumen: Unleashing versatile vision-centric capabilities of large multimodal models
Y Jiao, S Chen, Z Jie, J Chen, L Ma, YG Jiang
NeurIPS 2024, 2024
122024
Eyes can deceive: Benchmarking counterfactual reasoning abilities of multi-modal large language models
Y Li, W Tian, Y Jiao, J Chen, YG Jiang
arXiv e-prints, arXiv: 2404.12966, 2024
122024
From canteen food to daily meals: Generalizing food recognition to more practical scenarios
G Liu, Y Jiao, J Chen, B Zhu, YG Jiang
IEEE Transactions on Multimedia, 2024
102024
Eventhallusion: Diagnosing event hallucinations in video llms
J Zhang, Y Jiao, S Chen, J Chen, YG Jiang
arXiv preprint arXiv:2409.16597, 2024
82024
Instance-aware multi-camera 3D object detection with structural priors mining and self-boosting learning
Y Jiao, Z Jie, S Chen, L Cheng, J Chen, L Ma, YG Jiang
Proceedings of the AAAI Conference on Artificial Intelligence 38 (3), 2598-2606, 2024
82024
Suspected Objects Matter: Rethinking Model's Prediction for One-stage Visual Grounding
Y Jiao, Z Jie, J Chen, L Ma, YG Jiang
Proceedings of the 31st ACM International Conference on Multimedia, 17-26, 2023
72023
Eagle: Towards efficient arbitrary referring visual prompts comprehension for multimodal large language models
J Zhang, Y Jiao, S Chen, J Chen, YG Jiang
arXiv preprint arXiv:2409.16723, 2024
12024
В данный момент система не может выполнить эту операцию. Повторите попытку позднее.
Статьи 1–11