Scale-aware modulation meet transformer W Lin, Z Wu, J Chen, J Huang, L Jin Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 98 | 2023 |
Sphinx-x: Scaling data and parameters for a family of multi-modal large language models D Liu, R Zhang, L Qiu, S Huang, W Lin, S Zhao, S Geng, Z Lin, P Jin, ... arXiv preprint arXiv:2402.05935, 2024 | 94* | 2024 |
Lumina-mgpt: Illuminate flexible photorealistic text-to-image generation with multimodal generative pretraining D Liu, S Zhao, L Zhuo, W Lin, Y Qiao, H Li, P Gao arXiv preprint arXiv:2408.02657, 2024 | 26 | 2024 |
Draw-and-understand: Leveraging visual prompts to enable mllms to comprehend what you want W Lin, ... Draw-and-understand: Leveraging visual prompts to enable mllms to comprehend …, 2024 | 22* | 2024 |
Hierarchical side-tuning for vision transformers W Lin, Z Wu, W Yang, M Huang, J Huang, L Jin arXiv preprint arXiv:2310.05393, 2023 | 10 | 2023 |
M2SD: Multiple Mixing Self-Distillation for Few-Shot Class-Incremental Learning J Lin, Z Wu, W Lin, J Huang, RH Luo Proceedings of the AAAI Conference on Artificial Intelligence 38 (4), 3422-3431, 2024 | 6 | 2024 |
Rapid diffusion: Building domain-specific text-to-image synthesizers with fast inference speed B Liu, W Lin, Z Duan, C Wang, W Ziheng, Z Zipeng, K Jia, L Jin, C Chen, ... Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 4 | 2023 |
PixWizard: Versatile image-to-image visual assistant with open-language instructions W Lin, X Wei, R Zhang, L Zhuo, S Zhao, S Huang, J Xie, Y Qiao, P Gao, ... arXiv preprint arXiv:2409.15278, 2024 | 3 | 2024 |
Building A Mobile Text Recognizer via Truncated SVD-based Knowledge Distillation-Guided NAS. W Lin, C Xie, D Peng, J Wang, L Jin, W Ding, C Yao, M He BMVC, 375, 2023 | 1 | 2023 |
IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models J Lei, R Zhang, X Hu, W Lin, Z Li, W Sun, R Du, L Zhuo, Z Li, X Li, S Zhao, ... arXiv preprint arXiv:2501.13920, 2025 | | 2025 |
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects W Liu, L Liu, Y Guo, H Xiao, W Lin, Y Chai, S Ren, X Liang, L Li, W Wang, ... Preprints, 2025 | | 2025 |