Recent advances in 3d gaussian splatting
The emergence of 3D Gaussian splatting (3DGS) has greatly accelerated rendering in novel
view synthesis. Unlike neural implicit representations like neural radiance fields (NeRFs) …
view synthesis. Unlike neural implicit representations like neural radiance fields (NeRFs) …
Adversarial diffusion distillation
Abstract We introduce Adversarial Diffusion Distillation (ADD), a novel training approach that
efficiently samples large-scale foundational image diffusion models in just 1–4 steps while …
efficiently samples large-scale foundational image diffusion models in just 1–4 steps while …
Visual autoregressive modeling: Scalable image generation via next-scale prediction
Abstract We present Visual AutoRegressive modeling (VAR), a new generation paradigm
that redefines the autoregressive learning on images as coarse-to-fine" next-scale …
that redefines the autoregressive learning on images as coarse-to-fine" next-scale …
Lavie: High-quality video generation with cascaded latent diffusion models
This work aims to learn a high-quality text-to-video (T2V) generative model by leveraging a
pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task …
pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task …
Diffusion model alignment using direct preference optimization
Large language models (LLMs) are fine-tuned using human comparison data with
Reinforcement Learning from Human Feedback (RLHF) methods to make them better …
Reinforcement Learning from Human Feedback (RLHF) methods to make them better …
Align your gaussians: Text-to-4d with dynamic 3d gaussians and composed diffusion models
Text-guided diffusion models have revolutionized image and video generation and have
also been successfully used for optimization-based 3D object synthesis. Here we instead …
also been successfully used for optimization-based 3D object synthesis. Here we instead …
Distrifusion: Distributed parallel inference for high-resolution diffusion models
Diffusion models have achieved great success in synthesizing high-quality images.
However generating high-resolution images with diffusion models is still challenging due to …
However generating high-resolution images with diffusion models is still challenging due to …
Emu edit: Precise image editing via recognition and generation tasks
Instruction-based image editing holds immense potential for a variety of applications as it
enables users to perform any editing operation using a natural language instruction …
enables users to perform any editing operation using a natural language instruction …
Fast high-resolution image synthesis with latent adversarial diffusion distillation
Diffusion models are the main driver of progress in image and video synthesis, but suffer
from slow inference speed. Distillation methods, like the recently introduced adversarial …
from slow inference speed. Distillation methods, like the recently introduced adversarial …
Vista: A generalizable driving world model with high fidelity and versatile controllability
World models can foresee the outcomes of different actions, which is of paramount
importance for autonomous driving. Nevertheless, existing driving world models still have …
importance for autonomous driving. Nevertheless, existing driving world models still have …