Multimodal image synthesis and editing: A survey and taxonomy
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …
among multimodal information plays a key role for the creation and perception of multimodal …
Syncdreamer: Generating multiview-consistent images from a single-view image
In this paper, we present a novel diffusion model called that generates multiview-consistent
images from a single-view image. Using pretrained large-scale 2D diffusion models, recent …
images from a single-view image. Using pretrained large-scale 2D diffusion models, recent …
Grm: Large gaussian reconstruction model for efficient 3d reconstruction and generation
We introduce GRM, a large-scale reconstructor capable of recovering a 3D asset from
sparse-view images in around 0.1 s. GRM is a feed-forward transformer-based model that …
sparse-view images in around 0.1 s. GRM is a feed-forward transformer-based model that …
Sdfusion: Multimodal 3d shape completion, reconstruction, and generation
In this work, we present a novel framework built to simplify 3D asset generation for amateur
users. To enable interactive generation, our method supports a variety of input modalities …
users. To enable interactive generation, our method supports a variety of input modalities …
A survey on deep generative 3d-aware image synthesis
Recent years have seen remarkable progress in deep learning powered visual content
creation. This includes deep generative 3D-aware image synthesis, which produces high …
creation. This includes deep generative 3D-aware image synthesis, which produces high …
Gaussian shell maps for efficient 3d human generation
Efficient generation of 3D digital humans is important in several industries including virtual
reality social media and cinematic production. 3D generative adversarial networks (GANs) …
reality social media and cinematic production. 3D generative adversarial networks (GANs) …
Autodecoding latent 3d diffusion models
Diffusion-based methods have shown impressive visual results in the text-to-image domain.
They first learn a latent space using an autoencoder, then run a denoising process on the …
They first learn a latent space using an autoencoder, then run a denoising process on the …
3d-aware image generation using 2d diffusion models
In this paper, we introduce a novel 3D-aware image generation method that leverages 2D
diffusion models. We formulate the 3D-aware image generation task as multiview 2D image …
diffusion models. We formulate the 3D-aware image generation task as multiview 2D image …
Dmv3d: Denoising multi-view diffusion using 3d large reconstruction model
We propose\textbf {DMV3D}, a novel 3D generation approach that uses a transformer-based
3D large reconstruction model to denoise multi-view diffusion. Our reconstruction model …
3D large reconstruction model to denoise multi-view diffusion. Our reconstruction model …
Text2tex: Text-driven texture synthesis via diffusion models
Abstract We present Text2Tex, a novel method for generating high-quality textures for 3D
meshes from the given text prompts. Our method incorporates inpainting into a pre-trained …
meshes from the given text prompts. Our method incorporates inpainting into a pre-trained …