Scale-aware modulation meet transformer W Lin, Z Wu, J Chen, J Huang, L Jin Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 100 | 2023 |
Beautifulprompt: Towards automatic prompt engineering for text-to-image synthesis T Cao, C Wang, B Liu, Z Wu, J Zhu, J Huang arXiv preprint arXiv:2311.06752, 2023 | 22 | 2023 |
Hierarchical side-tuning for vision transformers W Lin, Z Wu, W Yang, M Huang, J Huang, L Jin arXiv preprint arXiv:2310.05393, 2023 | 10 | 2023 |
YOLOX-PAI: an improved YOLOX, stronger and faster than YOLOv6 Z Wu, X Zou, W Zhou, J Huang arXiv preprint arXiv:2208.13040, 2022 | 9 | 2022 |
Facechain: A playground for identity-preserving portrait generation Y Liu, C Yu, L Shang, Z Wu, X Wang, Y Zhao, L Zhu, C Cheng, W Chen, ... arXiv preprint arXiv:2308.14256, 2023 | 7 | 2023 |
M2SD: Multiple Mixing Self-Distillation for Few-Shot Class-Incremental Learning J Lin, Z Wu, W Lin, J Huang, RH Luo Proceedings of the AAAI Conference on Artificial Intelligence 38 (4), 3422-3431, 2024 | 6 | 2024 |
Elastic-link for binarized neural networks J Hu, Z Wu, V Tan, Z Lu, M Zeng, E Wu Proceedings of the AAAI Conference on Artificial Intelligence 36 (1), 942-950, 2022 | 6 | 2022 |
DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion Z Chu, J Chen, C Chen, C Wang, Z Wu, J Huang, W Qian Proceedings of the 2024 SIAM International Conference on Data Mining (SDM …, 2024 | 4 | 2024 |
EasyPhoto: your smart AI photo generator Z Wu, J Xu, X Zou, K Huang, X Shi, J Huang arXiv preprint arXiv:2310.04672, 2023 | 4 | 2023 |
Rapid diffusion: Building domain-specific text-to-image synthesizers with fast inference speed B Liu, W Lin, Z Duan, C Wang, W Ziheng, Z Zipeng, K Jia, L Jin, C Chen, ... Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 4 | 2023 |
Diffsynth: Latent in-iteration deflickering for realistic video synthesis Z Duan, L You, C Wang, C Chen, Z Wu, W Qian, J Huang Joint European Conference on Machine Learning and Knowledge Discovery in …, 2024 | 2 | 2024 |
Valley2: Exploring Multimodal Models with Scalable Vision-Language Design Z Wu, Z Chen, R Luo, C Zhang, Y Gao, Z He, X Wang, H Lin, M Qiu arXiv preprint arXiv:2501.05901, 2025 | | 2025 |