Crm: Single image to 3d textured mesh with convolutional reconstruction model
Feed-forward 3D generative models like the Large Reconstruction Model (LRM) have
demonstrated exceptional generation speed. However, the transformer-based methods do …
demonstrated exceptional generation speed. However, the transformer-based methods do …
Tc4d: Trajectory-conditioned text-to-4d generation
Recent techniques for text-to-4D generation synthesize dynamic 3D scenes using
supervision from pre-trained text-to-video models. However, existing representations, such …
supervision from pre-trained text-to-video models. However, existing representations, such …
Unidream: Unifying diffusion priors for relightable text-to-3d generation
Recent advancements in text-to-3D generation technology have significantly advanced the
conversion of textual descriptions into imaginative well-geometrical and finely textured 3D …
conversion of textual descriptions into imaginative well-geometrical and finely textured 3D …
Vd3d: Taming large video diffusion transformers for 3d camera control
Modern text-to-video synthesis models demonstrate coherent, photorealistic generation of
complex videos from a text description. However, most existing models lack fine-grained …
complex videos from a text description. However, most existing models lack fine-grained …
Instantmesh: Efficient 3d mesh generation from a single image with sparse-view large reconstruction models
We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a
single image, featuring state-of-the-art generation quality and significant training scalability …
single image, featuring state-of-the-art generation quality and significant training scalability …
Sv4d: Dynamic 3d content generation with multi-frame and multi-view consistency
We present Stable Video 4D (SV4D), a latent video diffusion model for multi-frame and multi-
view consistent dynamic 3D content generation. Unlike previous methods that rely on …
view consistent dynamic 3D content generation. Unlike previous methods that rely on …
Learning-based multi-view stereo: a survey
3D reconstruction aims to recover the dense 3D structure of a scene. It plays an essential
role in various applications such as Augmented/Virtual Reality (AR/VR), autonomous driving …
role in various applications such as Augmented/Virtual Reality (AR/VR), autonomous driving …
Scaledreamer: Scalable text-to-3d synthesis with asynchronous score distillation
By leveraging the text-to-image diffusion prior, score distillation can synthesize 3D contents
without paired text-3D training data. Instead of spending hours of online optimization per text …
without paired text-3D training data. Instead of spending hours of online optimization per text …
Im-3d: Iterative multiview diffusion and reconstruction for high-quality 3d generation
Most text-to-3D generators build upon off-the-shelf text-to-image models trained on billions
of images. They use variants of Score Distillation Sampling (SDS), which is slow, somewhat …
of images. They use variants of Score Distillation Sampling (SDS), which is slow, somewhat …
Scube: Instant large-scale scene reconstruction using voxsplats
We present SCube, a novel method for reconstructing large-scale 3D scenes (geometry,
appearance, and semantics) from a sparse set of posed images. Our method encodes …
appearance, and semantics) from a sparse set of posed images. Our method encodes …