Mixgen: A new multi-modal data augmentation X Hao, Y Zhu, S Appalaraju, A Zhang, W Zhang, B Li, M Li Proceedings of the IEEE/CVF winter conference on applications of computer …, 2023 | 109 | 2023 |
The robodrive challenge: Drive anytime anywhere in any condition L Kong, S Xie, H Hu, Y Niu, WT Ooi, BR Cottereau, LX Ng, Y Ma, W Zhang, ... ICRA 2024 Technical Report, 2024 | 22 | 2024 |
Dual alignment unsupervised domain adaptation for video-text retrieval X Hao, W Zhang, D Wu, F Zhu, B Li Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 21 | 2023 |
The end-of-end-to-end: A video understanding pentathlon challenge (2020) S Albanie, Y Liu, A Nagrani, A Miech, E Coto, I Laptev, R Sukthankar, ... arXiv preprint arXiv:2008.00744, 2020 | 15 | 2020 |
Is Your HD Map Constructor Reliable under Sensor Corruptions? X Hao, M Wei, Y Yang, H Zhao, H Zhang, Y Zhou, Q Wang, W Li, L Kong, ... NeurIPS 2024, 2024 | 12 | 2024 |
Listen and look: Multi-modal aggregation and co-attention network for video-audio retrieval X Hao, W Zhang, D Wu, F Zhu, B Li 2022 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2022 | 9 | 2022 |
Multi-feature graph attention network for cross-modal video-text retrieval X Hao, Y Zhou, D Wu, W Zhang, B Li, W Wang Proceedings of the 2021 international conference on multimedia retrieval …, 2021 | 9 | 2021 |
Uncertainty-aware alignment network for cross-domain video-text retrieval X Hao, W Zhang Advances in Neural Information Processing Systems 36, 2024 | 8 | 2024 |
What matters: Attentive and relational feature aggregation network for video-text retrieval X Hao, Y Zhou, D Wu, W Zhang, B Li, W Wang, D Meng 2021 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2021 | 8 | 2021 |
Mbfusion: A new multi-modal bev feature fusion method for hd map construction X Hao, H Zhang, Y Yang, Y Zhou, S Jung, SI Park, BI Yoo 2024 IEEE International Conference on Robotics and Automation (ICRA), 15922 …, 2024 | 7 | 2024 |
What foundation models can bring for robot learning in manipulation: A survey D Li, Y Jin, Y Sun, H Yu, J Shi, X Hao, P Hao, H Liu, F Sun, J Zhang, ... arXiv preprint arXiv:2404.18201, 2024 | 7 | 2024 |
MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation X Hao, R Li, H Zhang, D Li, R Yin, S Jung, SI Park, BI Yoo, H Zhao, ... ECCV 2024, 2024 | 5 | 2024 |
STViT+: improving self-supervised multi-camera depth estimation with spatial-temporal context and adversarial geometry regularization Z Chen, H Zhao, X Hao, B Yuan, X Li Applied Intelligence 55 (5), 328, 2025 | 1 | 2025 |
MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception X Hao, G Liu, Y Zhao, Y Ji, M Wei, H Zhao, L Kong, R Yin, Y Liu arXiv preprint arXiv:2501.01037, 2025 | 1 | 2025 |
TASAR: Transfer-based Attack on Skeletal Action Recognition Y Diao, B Wu, R Zhang, A Liu, X Hao, X Wei, M Wang, H Wang International Conference on Learning Representation (ICLR), 2024 | 1 | 2024 |
FTF-ER: Feature-Topology Fusion-Based Experience Replay Method for Continual Graph Learning J Pang, C Lin, X Hao, R Yin, Z Wang, Z Zhang, J He, HT Sheng ACM MM 2024, 2024 | 1 | 2024 |
Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track X Hao, Y Yang, H Zhang, M Wei, Y Zhou, H Zhao, J Zhang arXiv preprint arXiv:2405.10567, 2024 | 1 | 2024 |
Customized Treatment Per Pixel for Blind Image Super-Resolution G Liu, X Hao ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Bi-directional Hard-Negatives Ranking Loss for Cross-Modal Video-Text Retrieval X Hao | 1 | 2020 |
MapFusion: A Novel BEV Feature Fusion Network for Multi-modal Map Construction X Hao, Y Diao, M Wei, Y Yang, P Hao, R Yin, H Zhang, W Li, S Zhao, ... Information Fusion, 2025 | | 2025 |