Detclip: Dictionary-enriched visual-concept paralleled pre-training for open-world detection L Yao, J Han, Y Wen, X Liang, D Xu, W Zhang, Z Li, C Xu, H Xu Advances in Neural Information Processing Systems 35, 9125-9138, 2022 | 153 | 2022 |
SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving J Han, X Liang, H Xu, K Chen, L Hong, J Mao, C Ye, W Zhang, Z Li, ... arXiv preprint arXiv:2106.11118, 2021 | 116* | 2021 |
Coda: A real-world road corner case dataset for object detection in autonomous driving K Li, K Chen, H Wang, L Hong, C Ye, J Han, Y Chen, W Zhang, C Xu, ... European Conference on Computer Vision, 406-423, 2022 | 102 | 2022 |
CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data Y Zeng, C Jiang, J Mao, J Han, C Ye, Q Huang, DY Yeung, Z Yang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 89 | 2023 |
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment L Yao, J Han, X Liang, D Xu, W Zhang, Z Li, H Xu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 82 | 2023 |
Open-world semantic segmentation via contrasting and clustering vision-language embedding Q Liu, Y Wen, J Han, C Xu, H Xu, X Liang European Conference on Computer Vision, 275-292, 2022 | 79 | 2022 |
Detgpt: Detect what you need via reasoning R Pi, J Gao, S Diao, R Pan, H Dong, J Zhang, L Yao, J Han, H Xu, L Kong, ... arXiv preprint arXiv:2305.14167, 2023 | 78 | 2023 |
ONCE-3DLanes: Building Monocular 3D Lane Detection F Yan, M Nie, X Cai, J Han, H Xu, Z Yang, C Ye, Y Fu, MB Mi, L Zhang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 69 | 2022 |
Aggregating crowd wisdoms with label-aware autoencoders JH Li’ang Yin, W Zhang, Y Yu Proceedings of the 26th International Joint Conference on Artificial …, 2017 | 66 | 2017 |
HiLM-D: Towards High-Resolution Understanding in Multimodal Large Language Models for Autonomous Driving X Ding, J Han, H Xu, W Zhang, X Li arXiv preprint arXiv:2309.05186, 2023 | 64 | 2023 |
Laneformer: Object-aware Row-Column Transformers for Lane Detection J Han, X Deng, X Cai, Z Yang, H Xu, C Xu, X Liang Proceedings of the AAAI Conference on Artificial Intelligence 36 (1), 799-807, 2022 | 54 | 2022 |
Effective adaptation in multi-task co-training for unified autonomous driving X Liang, Y Wu, J Han, H Xu, C Xu, X Liang Advances in Neural Information Processing Systems 35, 19645-19658, 2022 | 39 | 2022 |
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining Y Long, Y Wen, J Han, H Xu, P Ren, W Zhang, S Zhao, X Liang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 34 | 2023 |
Nlip: Noise-robust language-image pre-training R Huang, Y Long, J Han, H Xu, X Liang, C Xu, X Liang Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 926-934, 2023 | 32 | 2023 |
Reason2drive: Towards interpretable and chain-based reasoning for autonomous driving M Nie, R Peng, C Wang, X Cai, J Han, H Xu, L Zhang European Conference on Computer Vision, 292-308, 2024 | 30 | 2024 |
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models X Ding, J Han, H Xu, X Liang, W Zhang, X Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 30 | 2024 |
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model J Gao, R Pi, J Zhang, J Ye, W Zhong, Y Wang, L Hong, J Han, H Xu, Z Li, ... arXiv preprint arXiv:2312.11370, 2023 | 24 | 2023 |
Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis K Chen, C Wang, K Yang, J Han, L Hong, F Mi, H Xu, Z Liu, W Huang, Z Li, ... arXiv preprint arXiv:2310.10477, 2023 | 24 | 2023 |
Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing LIU Zhili, J Han, L Hong, H Xu, K Chen, C Xu, Z Li Proceedings of the AAAI Conference on Artificial Intelligence 36 (2), 1854-1862, 2022 | 24 | 2022 |
Generative negative text replay for continual vision-language pretraining S Yan, L Hong, H Xu, J Han, T Tuytelaars, Z Li, X He European Conference on Computer Vision, 22-38, 2022 | 22 | 2022 |