Qwen2. 5 technical report A Yang, B Yang, B Zhang, B Hui, B Zheng, B Yu, C Li, D Liu, F Huang, ... arXiv preprint arXiv:2412.15115, 2024 | 798 | 2024 |
Parsing-based view-aware embedding network for vehicle re-identification D Meng, L Li, X Liu, Y Li, S Yang, ZJ Zha, X Gao, S Wang, Q Huang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 241 | 2020 |
Qwen2-vl: Enhancing vision-language model's perception of the world at any resolution P Wang, S Bai, S Tan, S Wang, Z Fan, J Bai, K Chen, X Liu, J Wang, W Ge, ... arXiv preprint arXiv:2409.12191, 2024 | 227 | 2024 |
Adaptive reconstruction network for weakly supervised referring expression grounding X Liu, L Li, S Wang, ZJ Zha, D Meng, Q Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 103 | 2019 |
Knowledge-guided pairwise reconstruction network for weakly supervised referring expression grounding X Liu, L Li, S Wang, ZJ Zha, L Su, Q Huang Proceedings of the 27th ACM International Conference on Multimedia, 539-547, 2019 | 61 | 2019 |
Entity-enhanced adaptive reconstruction network for weakly supervised referring expression grounding X Liu, L Li, S Wang, ZJ Zha, Z Li, Q Tian, Q Huang IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (3), 3003-3018, 2022 | 43 | 2022 |
Context disentangling and prototype inheriting for robust visual grounding W Tang, L Li, X Liu, L Jin, J Tang, Z Li IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 20 | 2023 |
Deeply coupled cross-modal prompt learning X Liu, W Tang, J Lu, R Zhao, Z Guo, F Tan Findings of the Association for Computational Linguistics: ACL 2023, 7957-7970, 2023 | 17 | 2023 |
What Large Language Models Bring to Text-rich VQA? X Liu, W Tang, X Ni, J Lu, R Zhao, Z Li, F Tan arXiv preprint arXiv:2311.07306, 2023 | 10 | 2023 |
Viewpoint alignment and discriminative parts enhancement in 3d space for vehicle reid D Meng, L Li, X Liu, L Gao, Q Huang IEEE Transactions on Multimedia 25, 2954-2965, 2022 | 8 | 2022 |
Local-binarized very deep residual network for visual categorization X Liu, L Li, S Wang, ZJ Zha, Q Huang Neurocomputing 430, 82-93, 2021 | 7 | 2021 |
PaDeLLM-NER: parallel decoding in large language models for named entity recognition J Lu, Z Yang, Y Wang, X Liu, B Mac Namee, C Huang arXiv preprint arXiv:2402.04838, 2024 | 5 | 2024 |
Transferrable referring expression grounding with concept transfer and context inheritance X Liu, L Li, S Wang, ZJ Zha, D Meng, Q Huang Proceedings of the 28th ACM International Conference on Multimedia, 3938-3946, 2020 | 4 | 2020 |
SynthDoc: Bilingual Documents Synthesis for Visual Document Understanding C Ding, X Liu, W Tang, J Li, X Wang, R Zhao, CT Nguyen, F Tan Proceedings of the 2nd Workshop on Large Generative Models Meet Multimodal …, 2024 | | 2024 |