Grm: Large gaussian reconstruction model for efficient 3d reconstruction and generation

Y Xu, Z Shi, W Yifan, H Chen, C Yang, S Peng… - … on Computer Vision, 2024 - Springer
We introduce GRM, a large-scale reconstructor capable of recovering a 3D asset from
sparse-view images in around 0.1 s. GRM is a feed-forward transformer-based model that …

Triplane meets gaussian splatting: Fast and generalizable single-view 3d reconstruction with transformers

ZX Zou, Z Yu, YC Guo, Y Li, D Liang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advancements in 3D reconstruction from single images have been driven by the
evolution of generative models. Prominent among these are methods based on Score …

Street gaussians: Modeling dynamic urban scenes with gaussian splatting

Y Yan, H Lin, C Zhou, W Wang, H Sun, K Zhan… - … on Computer Vision, 2024 - Springer
This paper aims to tackle the problem of modeling dynamic urban streets for autonomous
driving scenes. Recent methods extend NeRF by incorporating tracked vehicle poses to …

Dmv3d: Denoising multi-view diffusion using 3d large reconstruction model

Y Xu, H Tan, F Luan, S Bi, P Wang, J Li, Z Shi… - arxiv preprint arxiv …, 2023 - arxiv.org
We propose\textbf {DMV3D}, a novel 3D generation approach that uses a transformer-based
3D large reconstruction model to denoise multi-view diffusion. Our reconstruction model …

Cad: Photorealistic 3d generation via adversarial distillation

Z Wan, D Paschalidou, I Huang, H Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
The increased demand for 3D data in AR/VR robotics and gaming applications gave rise to
powerful generative pipelines capable of synthesizing high-quality 3D objects. Most of these …

Recent advances in implicit representation-based 3d shape generation

JM Sun, T Wu, L Gao - Visual Intelligence, 2024 - Springer
Various techniques have been developed and introduced to address the pressing need to
create three-dimensional (3D) content for advanced applications such as virtual reality and …

Gen2sim: Scaling up robot learning in simulation with generative models

P Katara, Z **an, K Fragkiadaki - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Generalist robot manipulators need to learn a wide variety of manipulation skills across
diverse environments. Current robot training pipelines rely on humans to provide kinesthetic …

Streetscapes: Large-scale consistent street view generation using autoregressive video diffusion

B Deng, R Tucker, Z Li, L Guibas, N Snavely… - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org
We present a method for generating Streetscapes—long sequences of views through an on-
the-fly synthesized city-scale scene. Our generation is conditioned by language input (eg …

NViST: In the Wild New View Synthesis from a Single Image with Transformers

W Jang, L Agapito - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
We propose NViST a transformer-based model for efficient and generalizable novel-view
synthesis from a single image for real-world scenes. In contrast to many methods that are …

Computational Sensing, Understanding, and Reasoning: An Artificial Intelligence Approach to Physics-Informed World Modeling

B Moya, A Badías, D González, F Chinesta… - … Methods in Engineering, 2024 - Springer
This work offers a discussion on how computational mechanics and physics-informed
machine learning can be integrated into the process of sensing, understanding, and …