Vbench: Comprehensive benchmark suite for video generative models

Z Huang, Y He, J Yu, F Zhang, C Si… - Proceedings of the …, 2024 - openaccess.thecvf.com
Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …

Videobooth: Diffusion-based video generation with image prompts

Y Jiang, T Wu, S Yang, C Si, D Lin… - Proceedings of the …, 2024 - openaccess.thecvf.com
Text-driven video generation witnesses rapid progress. However merely using text prompts
is not enough to depict the desired subject appearance that accurately aligns with users' …

Vbench++: Comprehensive and versatile benchmark suite for video generative models

Z Huang, F Zhang, X Xu, Y He, J Yu, Z Dong… - ar** with Displacement Vectors
R Nishikawa, C Gu, H Takahashi, S Kuriyama - IEEE Access, 2024 - ieeexplore.ieee.org
Recent deep learning techniques have enabled simple methods of image editing through
textual instructions; however, it remains challenging to perform specific and quantitative …

Lodge++: High-quality and Long Dance Generation with Vivid Choreography Patterns

R Li, H Zhang, Y Zhang, Y Zhang, Y Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
We propose Lodge++, a choreography framework to generate high-quality, ultra-long, and
vivid dances given the music and desired genre. To handle the challenges in computational …

Controllable image and video synthesis

Y Jiang - 2024 - dr.ntu.edu.sg
Generative models have witnessed remarkable progress in recent years, significantly
improving the quality of synthesized images and videos. This thesis extends these …