Visual chatgpt: Talking, drawing and editing with visual foundation models C Wu, S Yin, W Qi, X Wang, Z Tang, N Duan arXiv preprint arXiv:2303.04671, 2023 | 687 | 2023 |
Dragnuwa: Fine-grained control in video generation by integrating text, image, and trajectory S Yin, C Wu, J Liang, J Shi, H Li, G Ming, N Duan arXiv preprint arXiv:2308.08089, 2023 | 98 | 2023 |
Nuwa-xl: Diffusion over diffusion for extremely long video generation S Yin, C Wu, H Yang, J Wang, X Wang, M Ni, Z Yang, L Li, S Liu, F Yang, ... arXiv preprint arXiv:2303.12346, 2023 | 97 | 2023 |
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis Z Tang, C Wu, Z Zhang, M Ni, S Yin, Y Liu, Z Yang, L Wang, Z Liu, J Li, ... arXiv preprint arXiv:2401.17093, 2024 | 9 | 2024 |
ORES: Open-vocabulary Responsible Visual Synthesis M Ni, C Wu, X Wang, S Yin, L Wang, Z Liu, N Duan Proceedings of the AAAI Conference on Artificial Intelligence 38 (19), 21473 …, 2024 | 7 | 2024 |
EG4D: Explicit Generation of 4D Object without Score Distillation Q Sun, Z Guo, Z Wan, JN Yan, S Yin, W Zhou, J Liao, H Li arXiv preprint arXiv:2405.18132, 2024 | 6 | 2024 |
Using Left and Right Brains Together: Towards Vision and Language Planning J Cen, C Wu, X Liu, S Yin, Y Pei, J Yang, Q Chen, N Duan, J Zhang arXiv preprint arXiv:2402.10534, 2024 | 4 | 2024 |
Learning 3D photography videos via self-supervised diffusion on single images X Wang, C Wu, S Yin, M Ni, J Wang, L Li, Z Yang, F Yang, L Wang, Z Liu, ... arXiv preprint arXiv:2302.10781, 2023 | 4 | 2023 |