A survey on video diffusion models
The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …
Lavie: High-quality video generation with cascaded latent diffusion models
This work aims to learn a high-quality text-to-video (T2V) generative model by leveraging a
pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task …
pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task …
Vbench: Comprehensive benchmark suite for video generative models
Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …
remains a challenge. A comprehensive evaluation benchmark for video generation is …
Gauhuman: Articulated gaussian splatting from monocular human videos
We present GauHuman a 3D human model with Gaussian Splatting for both fast training (1 2
minutes) and real-time rendering (up to 189 FPS) compared with existing NeRF-based …
minutes) and real-time rendering (up to 189 FPS) compared with existing NeRF-based …
Freeinit: Bridging initialization gap in video diffusion models
Though diffusion-based video generation has witnessed rapid progress, the inference
results of existing models still exhibit unsatisfactory temporal consistency and unnatural …
results of existing models still exhibit unsatisfactory temporal consistency and unnatural …
Videobooth: Diffusion-based video generation with image prompts
Text-driven video generation witnesses rapid progress. However merely using text prompts
is not enough to depict the desired subject appearance that accurately aligns with users' …
is not enough to depict the desired subject appearance that accurately aligns with users' …
Id-animator: Zero-shot identity-preserving human video generation
Generating high-fidelity human video with specified identities has attracted significant
attention in the content generation community. However, existing techniques struggle to …
attention in the content generation community. However, existing techniques struggle to …
Disco: Disentangled control for realistic human dance generation
Generative AI has made significant strides in computer vision particularly in text-driven
image/video synthesis (T2I/T2V). Despite the notable advancements it remains challenging …
image/video synthesis (T2I/T2V). Despite the notable advancements it remains challenging …
Appearance and Pose-guided Human Generation: A Survey
F Liao, X Zou, W Wong - ACM Computing Surveys, 2024 - dl.acm.org
Appearance and pose-guided human generation is a burgeoning field that has captured
significant attention. This subject's primary objective is to transfer pose information from a …
significant attention. This subject's primary objective is to transfer pose information from a …
Compositional abilities emerge multiplicatively: Exploring diffusion models on a synthetic task
Modern generative models exhibit unprecedented capabilities to generate extremely
realistic data. However, given the inherent compositionality of real world, reliable use of …
realistic data. However, given the inherent compositionality of real world, reliable use of …