Dreamllm: Synergistic multimodal comprehension and creation R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang, L Zhao, J Sun, H Zhou, H Wei, ... arXiv preprint arXiv:2309.11499, 2023 | 112 | 2023 |
Learning to learn adaptive classifier–predictor for few-shot learning N Lai, M Kan, C Han, X Song, S Shan IEEE transactions on neural networks and learning systems 32 (8), 3458-3470, 2020 | 109 | 2020 |
Vary: Scaling up the vision vocabulary for large vision-language model H Wei, L Kong, J Chen, L Zhao, Z Ge, J Yang, J Sun, C Han, X Zhang European Conference on Computer Vision, 408-424, 2024 | 72 | 2024 |
Exploring recurrent long-term temporal fusion for multi-view 3d perception C Han, J Yang, J Sun, Z Ge, R Dong, H Zhou, W Mao, Y Peng, X Zhang IEEE Robotics and Automation Letters, 2024 | 58 | 2024 |
Face recognition with contrastive convolution C Han, S Shan, M Kan, S Wu, X Chen Proceedings of the European Conference on Computer Vision (ECCV), 118-134, 2018 | 57 | 2018 |
Chatspot: Bootstrapping multimodal llms via precise referring instruction tuning L Zhao, E Yu, Z Ge, J Yang, H Wei, H Zhou, J Sun, Y Peng, R Dong, ... arXiv preprint arXiv:2307.09474, 2023 | 49 | 2023 |
Shapellm: Universal 3d object understanding for embodied interaction Z Qi, R Dong, S Zhang, H Geng, C Han, Z Ge, L Yi, K Ma European Conference on Computer Vision, 214-238, 2024 | 40 | 2024 |
Small language model meets with reinforced vision vocabulary H Wei, L Kong, J Chen, L Zhao, Z Ge, E Yu, J Sun, C Han, X Zhang arXiv preprint arXiv:2401.12503, 2024 | 30 | 2024 |
Xiangwen Kong, Xiangyu Zhang, Kaisheng Ma, and Li Yi. Dreamllm: Synergistic multimodal comprehension and creation R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang, L Zhao, J Sun, H Zhou, H Wei arXiv preprint arXiv:2309.11499 3, 2023 | 30 | 2023 |
General ocr theory: Towards ocr-2.0 via a unified end-to-end model H Wei, C Liu, J Chen, J Wang, L Kong, Y Xu, Z Ge, L Zhao, J Sun, Y Peng, ... | 18 | 2024 |
Focus Anywhere for Fine-grained Multi-page Document Understanding C Liu, H Wei, J Chen, L Kong, Z Ge, Z Zhu, L Zhao, J Sun, C Han, ... arXiv preprint arXiv:2405.14295, 2024 | 14 | 2024 |
Personalized convolution for face recognition C Han, S Shan, M Kan, S Wu, X Chen International journal of computer vision 130 (2), 344-362, 2022 | 14 | 2022 |
Onechart: Purify the chart structural extraction via one auxiliary token J Chen, L Kong, H Wei, C Liu, Z Ge, L Zhao, J Sun, C Han, X Zhang Proceedings of the 32nd ACM International Conference on Multimedia, 147-155, 2024 | 13 | 2024 |
Dreambench++: A human-aligned benchmark for personalized image generation Y Peng, Y Cui, H Tang, Z Qi, R Dong, J Bai, C Han, Z Ge, X Zhang, ST Xia arXiv preprint arXiv:2406.16855, 2024 | 13 | 2024 |
The 1st-place solution for cvpr 2023 openlane topology in autonomous driving challenge D Wu, F Jia, J Chang, Z Li, J Sun, C Han, S Li, Y Liu, Z Ge, T Wang arXiv preprint arXiv:2306.09590, 2023 | 11 | 2023 |
Grouplane: End-to-end 3d lane detection with channel-wise grouping Z Li, C Han, Z Ge, J Yang, E Yu, H Wang, X Zhang, H Zhao IEEE Robotics and Automation Letters, 2024 | 8 | 2024 |
Xiangwen Kong, Xiangyu Zhang, Kaisheng Ma, and Li Yi. 2024. Dream-LLM: Synergistic multimodal comprehension and creation R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang, L Zhao, J Sun, H Zhou, H Wei The Twelfth International Conference on Learning Representations. https …, 2023 | 7 | 2023 |
Triplet knowledge distillation X Wang, D Liu, M Kan, C Han, Z Wu, S Shan arXiv preprint arXiv:2305.15975, 2023 | 4 | 2023 |
SCSC: Spatial Cross-scale Convolution Module to Strengthen both CNNs and Transformers X Wang, X Chu, C Han, X Zhang Proceedings of the IEEE/CVF International Conference on Computer Vision, 731-741, 2023 | 1 | 2023 |
Corrections to “Learning to Learn Adaptive Classifier-Predictor for Few-Shot Learning”[Aug 21 3458-3470] N Lai, M Kan, C Han, X Song, S Shan IEEE Transactions on Neural Networks and Learning Systems 32 (8), 3784-3784, 2020 | 1 | 2020 |