A survey on video diffusion models
The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …
Pros and cons of GAN evaluation measures: New developments
A Borji - Computer Vision and Image Understanding, 2022 - Elsevier
This work is an update of my previous paper on the same topic published a few years ago
(Borji, 2019). With the dramatic progress in generative modeling, a suite of new quantitative …
(Borji, 2019). With the dramatic progress in generative modeling, a suite of new quantitative …
Imagen video: High definition video generation with diffusion models
We present Imagen Video, a text-conditional video generation system based on a cascade
of video diffusion models. Given a text prompt, Imagen Video generates high definition …
of video diffusion models. Given a text prompt, Imagen Video generates high definition …
Dynamicrafter: Animating open-domain images with video diffusion priors
Animating a still image offers an engaging visual experience. Traditional image animation
techniques mainly focus on animating natural scenes with stochastic dynamics (eg clouds …
techniques mainly focus on animating natural scenes with stochastic dynamics (eg clouds …
Vbench: Comprehensive benchmark suite for video generative models
Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …
remains a challenge. A comprehensive evaluation benchmark for video generation is …
Diffusion probabilistic modeling for video generation
Denoising diffusion probabilistic models are a promising new class of generative models
that mark a milestone in high-quality image generation. This paper showcases their ability to …
that mark a milestone in high-quality image generation. This paper showcases their ability to …
Speech gesture generation from the trimodal context of text, audio, and speaker identity
For human-like agents, including virtual avatars and social robots, making proper gestures
while speaking is crucial in human-agent interaction. Co-speech gestures enhance …
while speaking is crucial in human-agent interaction. Co-speech gestures enhance …
Zigma: A dit-style zigzag mamba diffusion model
The diffusion model has long been plagued by scalability and quadratic complexity issues,
especially within transformer-based structures. In this study, we aim to leverage the long …
especially within transformer-based structures. In this study, we aim to leverage the long …
Tooncrafter: Generative cartoon interpolation
We introduce ToonCrafter, a novel approach that transcends traditional correspondence-
based cartoon video interpolation, paving the way for generative interpolation. Traditional …
based cartoon video interpolation, paving the way for generative interpolation. Traditional …
Fetv: A benchmark for fine-grained evaluation of open-domain text-to-video generation
Recently, open-domain text-to-video (T2V) generation models have made remarkable
progress. However, the promising results are mainly shown by the qualitative cases of …
progress. However, the promising results are mainly shown by the qualitative cases of …