HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination & Visual Illusion in Large Vision-Language Models T Guan*, F Liu*, X Wu, R Xian, Z Li, X Liu, X Wang, L Chen, F Huang, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), 2024 | 280* | 2024 |
On the safety concerns of deploying llms/vlms in robotics: Highlighting the risks and vulnerabilities X Wu, R Xian, T Guan, J Liang, S Chakraborty, F Liu, BM Sadler, ... First Vision and Language for Autonomous Driving and Robotics Workshop, 2024 | 25* | 2024 |
Aztr: Aerial video action recognition with auto zoom and temporal reasoning R Xian*, X Wang*, T Guan, CM de Melo, SM Nogar, A Bera, D Manocha IEEE International Conference on Robotics and Automation (ICRA 2023), 2023 | 15 | 2023 |
PMI Sampler: Patch similarity guided frame selection for Aerial Action Recognition R Xian, X Wang, D Kothandaraman, D Manocha IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024), 2024 | 12 | 2024 |
Mitfas: Mutual information based temporal feature alignment and sampling for aerial video action recognition R Xian*, X Wang*, D Manocha IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024), 2024 | 12 | 2024 |
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models X Wu, T Guan, D Li, S Huang, X Liu, X Wang, R Xian, A Shrivastava, ... 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), 2024 | 7 | 2024 |
SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition X Wang, R Xian, T Guan, F Liu, D Manocha 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2024 | 1 | 2024 |
Real-time human action recognition from aerial videos using autozoom and synthetic data R Xian, BI Vogel, CM De Melo, AV Harrison, D Manocha SPIE Defense + Commercial Sensing conference (DCS 2024) 13035, 119-129, 2024 | 1 | 2024 |
AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales R Xian*, T Guan*, X Wang, X Wu, M Elnoor, D Song, D Manocha IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024), 2024 | 1 | 2024 |
PLAR: Prompt Learning for Action Recognition R Xian*, X Wang*, T Guan, D Manocha IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024), 2023 | 1* | 2023 |
DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments X Wang, P Sandoval-Segura, C Zhang, J Huang, T Guan, R Xian, F Liu, ... arXiv preprint arXiv:2412.20042, 2024 | | 2024 |
Robot Navigation Using Physically Grounded Vision-Language Models in Outdoor Environments M Elnoor, K Weerakoon, G Seneviratne, R Xian, T Guan, MKM Jaffar, ... arXiv preprint arXiv:2409.20445, 2024 | | 2024 |
SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining R Xian, X Wu, T Guan, X Wang, B Gong, D Manocha arXiv preprint arXiv:2409.18300, 2024 | | 2024 |
IndianRoad: A Video Dataset of Diverse Atomic Visual Elements in Dense and Unpredictable Environments X Wang, P Sandoval-Segura, T Guan, R Xian, F Liu, R Chandra, B Gong, ... | | |