Diffusion models: A comprehensive survey of methods and applications
Diffusion models have emerged as a powerful new family of deep generative models with
record-breaking performance in many applications, including image synthesis, video …
record-breaking performance in many applications, including image synthesis, video …
ChatGPT is not all you need. A State of the Art Review of large Generative AI models
During the last two years there has been a plethora of large generative models such as
ChatGPT or Stable Diffusion that have been published. Concretely, these models are able to …
ChatGPT or Stable Diffusion that have been published. Concretely, these models are able to …
Prolificdreamer: High-fidelity and diverse text-to-3d generation with variational score distillation
Score distillation sampling (SDS) has shown great promise in text-to-3D generation by
distilling pretrained large-scale text-to-image diffusion models, but suffers from over …
distilling pretrained large-scale text-to-image diffusion models, but suffers from over …
Zero-1-to-3: Zero-shot one image to 3d object
Abstract We introduce Zero-1-to-3, a framework for changing the camera viewpoint of an
object given just a single RGB image. To perform novel view synthesis in this …
object given just a single RGB image. To perform novel view synthesis in this …
One-2-3-45: Any single image to 3d mesh in 45 seconds without per-shape optimization
Single image 3D reconstruction is an important but challenging task that requires extensive
knowledge of our natural world. Many existing methods solve this problem by optimizing a …
knowledge of our natural world. Many existing methods solve this problem by optimizing a …
Fantasia3d: Disentangling geometry and appearance for high-quality text-to-3d content creation
Automatic 3D content creation has achieved rapid progress recently due to the availability of
pre-trained, large language models and image diffusion models, forming the emerging topic …
pre-trained, large language models and image diffusion models, forming the emerging topic …
Hexplane: A fast representation for dynamic scenes
Modeling and re-rendering dynamic 3D scenes is a challenging task in 3D vision. Prior
approaches build on NeRF and rely on implicit representations. This is slow since it requires …
approaches build on NeRF and rely on implicit representations. This is slow since it requires …
Instruct-nerf2nerf: Editing 3d scenes with instructions
We propose a method for editing NeRF scenes with text-instructions. Given a NeRF of a
scene and the collection of images used to reconstruct it, our method uses an image …
scene and the collection of images used to reconstruct it, our method uses an image …
Objaverse-xl: A universe of 10m+ 3d objects
Natural language processing and 2D vision models have attained remarkable proficiency on
many tasks primarily by escalating the scale of training data. However, 3D vision tasks have …
many tasks primarily by escalating the scale of training data. However, 3D vision tasks have …
Emergent correspondence from image diffusion
Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …
this paper, we show that correspondence emerges in image diffusion models without any …