Animatediff: Animate your personalized text-to-image diffusion models without specific tuning Y Guo, C Yang, A Rao, Z Liang, Y Wang, Y Qiao, M Agrawala, D Lin, ... International Conference on Learning Representations (ICLR) 2024, 2023 | 623 | 2023 |
Lavie: High-quality video generation with cascaded latent diffusion models Y Wang, X Chen, X Ma, S Zhou, Z Huang, Y Wang, C Yang, Y He, J Yu, ... International Journal of Computer Vision, 1-20, 2024 | 217 | 2024 |
Sparsectrl: Adding sparse controls to text-to-video diffusion models Y Guo, C Yang, A Rao, M Agrawala, D Lin, B Dai The 18th European Conference on Computer Vision (ECCV) 2024, 2023 | 76 | 2023 |
Cameractrl: Enabling camera control for text-to-video generation H He, Y Xu, Y Guo, G Wetzstein, B Dai, H Li, C Yang International Conference on Learning Representations (ICLR) 2025, 2024 | 66 | 2024 |
Dynamic storyboard generation in an engine-based virtual environment for video production A Rao, X Jiang, Y Guo, L Xu, L Yang, L Jin, D Lin, B Dai ACM SIGGRAPH 2023 Posters, 1-2, 2023 | 15 | 2023 |
Temporal and contextual transformer for multi-camera editing of TV shows A Rao, X Jiang, S Wang, Y Guo, Z Liu, B Dai, L Pang, X Wu, D Lin, L Jin arXiv preprint arXiv:2210.08737, 2022 | 8 | 2022 |
Humanvid: Demystifying training data for camera-controllable human image animation Z Wang, Y Li, Y Zeng, Y Fang, Y Guo, W Liu, J Tan, K Chen, T Xue, B Dai, ... arXiv preprint arXiv:2407.17438, 2024 | 5 | 2024 |
Sam2long: Enhancing sam 2 for long video segmentation with a training-free memory tree S Ding, R Qian, X Dong, P Zhang, Y Zang, Y Cao, Y Guo, D Lin, J Wang arXiv preprint arXiv:2410.16268, 2024 | 3 | 2024 |
Imagine360: Immersive 360 Video Generation from Perspective Anchor J Tan, S Yang, T Wu, J He, Y Guo, Z Liu, D Lin arXiv preprint arXiv:2412.03552, 2024 | | 2024 |
Generative Models for Visual Content Editing and Creation A Rao, Y Xiangli, Y Guo, M Tang, C Meng, M Agrawala ACM SIGGRAPH 2024 Courses, 1-6, 2024 | | 2024 |