Nuscenes-qa: A multi-modal visual question answering benchmark for autonomous driving scenario T Qian, J Chen, L Zhuo, Y Jiao, YG Jiang Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 4542-4550, 2024 | 104 | 2024 |
Scene graph refinement network for visual question answering T Qian, J Chen, S Chen, B Wu, YG Jiang IEEE Transactions on Multimedia 25, 3950-3961, 2022 | 46 | 2022 |
Video moment retrieval from text queries via single frame annotation R Cui, T Qian, P Peng, E Daskalaki, J Chen, X Guo, H Sun, YG Jiang Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022 | 35 | 2022 |
Locate before answering: Answer guided question localization for video question answering T Qian, R Cui, J Chen, P Peng, X Guo, YG Jiang IEEE Transactions on Multimedia 26, 4554-4563, 2023 | 19 | 2023 |
Prompt as Free Lunch: Enhancing Diversity in Source-Free Cross-domain Few-shot Learning through Semantic-Guided Prompting L Zhuo, Z Wang, Y Fu, T Qian arXiv preprint arXiv:2412.00767, 2024 | 1 | 2024 |