Modelscope text-to-video technical report J Wang, H Yuan, D Chen, Y Zhang, X Wang, S Zhang arXiv preprint arXiv:2308.06571, 2023 | 333 | 2023 |
Videofusion: Decomposed diffusion models for high-quality video generation Z Luo, D Chen, Y Zhang, Y Huang, L Wang, Y Shen, D Zhao, J Zhou, ... arXiv preprint arXiv:2303.08320, 2023 | 294 | 2023 |
Videocomposer: Compositional video synthesis with motion controllability X Wang, H Yuan, S Zhang, D Chen, J Wang, Y Zhang, Y Shen, D Zhao, ... Advances in Neural Information Processing Systems 36, 2024 | 285 | 2024 |
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding Z Li, J Zhang, Q Lin, J Xiong, Y Long, X Deng, Y Zhang, X Liu, M Huang, ... arXiv preprint arXiv:2405.08748, 2024 | 40 | 2024 |