Diffumask: Synthesizing images with pixel-level annotations for semantic segmentation using diffusion models W Wu, Y Zhao, MZ Shou, H Zhou, C Shen Proc. Int. Conf. Computer Vision (ICCV 2023), 2023 | 158 | 2023 |
Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models Y Gu, X Wang, JZ Wu, Y Shi, Y Chen, Z Fan, W Xiao, R Zhao, S Chang, ... Proc. Advances In Neural Information Processing Systems (NeurIPS 2023), 2023 | 141 | 2023 |
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models W Wu, Y Zhao, H Chen, Y Gu, R Zhao, Y He, H Zhou, MZ Shou, C Shen Proc. Advances In Neural Information Processing Systems (NeurIPS 2023), 2023 | 96 | 2023 |
Motiondirector: Motion customization of text-to-video diffusion models R Zhao, Y Gu, JZ Wu, DJ Zhang, J Liu, W Wu, J Keppo, MZ Shou The 18th European Conference on Computer Vision(ECCV 2024), 2023 | 90 | 2023 |
PTQD: Accurate Post-Training Quantization for Diffusion Models Y He, L Liu, J Liu, W Wu, H Zhou, B Zhuang Proc. Advances In Neural Information Processing Systems (NeurIPS 2023), 2023 | 84 | 2023 |
Bivit: Extremely compressed binary vision transformer Y He, Z Lou, L Zhang, W Wu, B Zhuang, H Zhou Proc. Int. Conf. Computer Vision (ICCV 2023), 2022 | 38 | 2022 |
Efficientclip: Efficient cross-modal pre-training by ensemble confident learning and language modeling J Wang, H Wang, J Deng, W Wu, D Zhang First Workshop on Pre-training: Perspectives, Pitfalls, and Paths Forward at …, 2021 | 37 | 2021 |
DragAnything: Motion Control for Anything using Entity Representation W Wu, Z Li, Y Gu, R Zhao, Y He, DJ Zhang, MZ Shou, Y Li, T Gao, ... The 18th European Conference on Computer Vision(ECCV 2024), 2024 | 35 | 2024 |
Efficientdm: Efficient quantization-aware fine-tuning of low-bit diffusion models Y He, J Liu, W Wu, H Zhou, B Zhuang The Twelfth International Conference on Learning Representations(ICLR 2024 …, 2023 | 35 | 2023 |
A bilingual, OpenWorld video text dataset and end-to-end video text spotter with transformer W Wu, Y Cai, D Zhang, S Wang, Z Li, J Li, Y Tang, H Zhou Proc. Advances In Neural Information Processing Systems (NeurIPS 2021), 2021 | 34 | 2021 |
Generative Prompt Model for Weakly Supervised Object Localization Y Zhao, Q Ye, W Wu, C Shen, F Wan Proc. Int. Conf. Computer Vision (ICCV 2023), 2023 | 29 | 2023 |
End-to-end video text spotting with transformer W Wu, Y Cai, C Shen, D Zhang, Y Fu, H Zhou, P Luo International Journal of Computer Vision (IJCV), 2022 | 26 | 2022 |
Synthetic-to-real unsupervised domain adaptation for scene text detection in the wild W Wu, N Lu, E Xie, Y Wang, W Yu, C Yang, H Zhou Proceedings of the Asian Conference on Computer Vision (ACCV 2020), 2020 | 25 | 2020 |
A large cross-modal video retrieval dataset with reading comprehension W Wu, Y Zhao, Z Li, J Li, H Zhou, MZ Shou, X Bai Pattern Recognition 157, 110818, 2025 | 20 | 2025 |
Paragraph-to-image generation with information-enriched diffusion model W Wu, Z Li, Y He, MZ Shou, C Shen, L Cheng, Y Li, T Gao, D Zhang, ... arXiv preprint arXiv:2311.14284, 2023 | 18 | 2023 |
Explore faster localization learning for scene text detection Y Zhao, Y Cai, W Wu, W Wang 2023 IEEE International Conference on Multimedia and Expo (ICME), 156-161, 2023 | 18 | 2023 |
Texts as lines: text detection with weak supervision W Wu, J Xing, C Yang, Y Wang, H Zhou Mathematical Problems in Engineering 2020 (1), 3871897, 2020 | 18 | 2020 |
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification Y He, L Zhang, W Wu, J Liu, H Zhou, B Zhuang Proc. Advances In Neural Information Processing Systems (NeurIPS 2024), 2024 | 16 | 2024 |
Continual Learning for Image Segmentation with Dynamic Query W Wu, Y Zhao, Z Li, L Shan, H Zhou, MZ Shou IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY(TCSVT), 2023 | 16 | 2023 |
Textcohesion: Detecting text for arbitrary shapes W Wu, J Xing, H Zhou Mathematical Problems in Engineering 2020, 2019 | 13 | 2019 |