Grm: Large gaussian reconstruction model for efficient 3d reconstruction and generation
We introduce GRM, a large-scale reconstructor capable of recovering a 3D asset from
sparse-view images in around 0.1 s. GRM is a feed-forward transformer-based model that …
sparse-view images in around 0.1 s. GRM is a feed-forward transformer-based model that …
Triplane meets gaussian splatting: Fast and generalizable single-view 3d reconstruction with transformers
Recent advancements in 3D reconstruction from single images have been driven by the
evolution of generative models. Prominent among these are methods based on Score …
evolution of generative models. Prominent among these are methods based on Score …
Street gaussians: Modeling dynamic urban scenes with gaussian splatting
This paper aims to tackle the problem of modeling dynamic urban streets for autonomous
driving scenes. Recent methods extend NeRF by incorporating tracked vehicle poses to …
driving scenes. Recent methods extend NeRF by incorporating tracked vehicle poses to …
Dmv3d: Denoising multi-view diffusion using 3d large reconstruction model
We propose\textbf {DMV3D}, a novel 3D generation approach that uses a transformer-based
3D large reconstruction model to denoise multi-view diffusion. Our reconstruction model …
3D large reconstruction model to denoise multi-view diffusion. Our reconstruction model …
Cad: Photorealistic 3d generation via adversarial distillation
The increased demand for 3D data in AR/VR robotics and gaming applications gave rise to
powerful generative pipelines capable of synthesizing high-quality 3D objects. Most of these …
powerful generative pipelines capable of synthesizing high-quality 3D objects. Most of these …
Recent advances in implicit representation-based 3d shape generation
Various techniques have been developed and introduced to address the pressing need to
create three-dimensional (3D) content for advanced applications such as virtual reality and …
create three-dimensional (3D) content for advanced applications such as virtual reality and …
Gen2sim: Scaling up robot learning in simulation with generative models
Generalist robot manipulators need to learn a wide variety of manipulation skills across
diverse environments. Current robot training pipelines rely on humans to provide kinesthetic …
diverse environments. Current robot training pipelines rely on humans to provide kinesthetic …
Streetscapes: Large-scale consistent street view generation using autoregressive video diffusion
We present a method for generating Streetscapes—long sequences of views through an on-
the-fly synthesized city-scale scene. Our generation is conditioned by language input (eg …
the-fly synthesized city-scale scene. Our generation is conditioned by language input (eg …
NViST: In the Wild New View Synthesis from a Single Image with Transformers
We propose NViST a transformer-based model for efficient and generalizable novel-view
synthesis from a single image for real-world scenes. In contrast to many methods that are …
synthesis from a single image for real-world scenes. In contrast to many methods that are …
Computational Sensing, Understanding, and Reasoning: An Artificial Intelligence Approach to Physics-Informed World Modeling
This work offers a discussion on how computational mechanics and physics-informed
machine learning can be integrated into the process of sensing, understanding, and …
machine learning can be integrated into the process of sensing, understanding, and …