Segui
Ruiqi Xian
Titolo
Citata da
Citata da
Anno
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination & Visual Illusion in Large Vision-Language Models
T Guan*, F Liu*, X Wu, R Xian, Z Li, X Liu, X Wang, L Chen, F Huang, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), 2024
280*2024
On the safety concerns of deploying llms/vlms in robotics: Highlighting the risks and vulnerabilities
X Wu, R Xian, T Guan, J Liang, S Chakraborty, F Liu, BM Sadler, ...
First Vision and Language for Autonomous Driving and Robotics Workshop, 2024
25*2024
Aztr: Aerial video action recognition with auto zoom and temporal reasoning
R Xian*, X Wang*, T Guan, CM de Melo, SM Nogar, A Bera, D Manocha
IEEE International Conference on Robotics and Automation (ICRA 2023), 2023
152023
PMI Sampler: Patch similarity guided frame selection for Aerial Action Recognition
R Xian, X Wang, D Kothandaraman, D Manocha
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024), 2024
122024
Mitfas: Mutual information based temporal feature alignment and sampling for aerial video action recognition
R Xian*, X Wang*, D Manocha
IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024), 2024
122024
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
X Wu, T Guan, D Li, S Huang, X Liu, X Wang, R Xian, A Shrivastava, ...
2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), 2024
72024
SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition
X Wang, R Xian, T Guan, F Liu, D Manocha
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2024
12024
Real-time human action recognition from aerial videos using autozoom and synthetic data
R Xian, BI Vogel, CM De Melo, AV Harrison, D Manocha
SPIE Defense + Commercial Sensing conference (DCS 2024) 13035, 119-129, 2024
12024
AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales
R Xian*, T Guan*, X Wang, X Wu, M Elnoor, D Song, D Manocha
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024), 2024
12024
PLAR: Prompt Learning for Action Recognition
R Xian*, X Wang*, T Guan, D Manocha
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024), 2023
1*2023
DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments
X Wang, P Sandoval-Segura, C Zhang, J Huang, T Guan, R Xian, F Liu, ...
arXiv preprint arXiv:2412.20042, 2024
2024
Robot Navigation Using Physically Grounded Vision-Language Models in Outdoor Environments
M Elnoor, K Weerakoon, G Seneviratne, R Xian, T Guan, MKM Jaffar, ...
arXiv preprint arXiv:2409.20445, 2024
2024
SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining
R Xian, X Wu, T Guan, X Wang, B Gong, D Manocha
arXiv preprint arXiv:2409.18300, 2024
2024
IndianRoad: A Video Dataset of Diverse Atomic Visual Elements in Dense and Unpredictable Environments
X Wang, P Sandoval-Segura, T Guan, R Xian, F Liu, R Chandra, B Gong, ...
Il sistema al momento non può eseguire l'operazione. Riprova più tardi.
Articoli 1–14