State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

3d gaussian splatting as new era: A survey

B Fei, J Xu, R Zhang, Q Zhou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
3D Gaussian Splatting (3D-GS) has emerged as a significant advancement in the field of
Computer Graphics, offering explicit scene representation and novel view synthesis without …

Lgm: Large multi-view gaussian model for high-resolution 3d content creation

J Tang, Z Chen, X Chen, T Wang, G Zeng… - European Conference on …, 2024 - Springer
Abstract 3D content creation has achieved significant progress in terms of both quality and
speed. Although current feed-forward models can produce 3D objects in seconds, their …

Splatter image: Ultra-fast single-view 3d reconstruction

S Szymanowicz, C Rupprecht… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract We introduce the Splatter Image an ultra-efficient approach for monocular 3D object
reconstruction. Splatter Image is based on Gaussian Splatting which allows fast and high …

Triplane meets gaussian splatting: Fast and generalizable single-view 3d reconstruction with transformers

ZX Zou, Z Yu, YC Guo, Y Li, D Liang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advancements in 3D reconstruction from single images have been driven by the
evolution of generative models. Prominent among these are methods based on Score …

Grm: Large gaussian reconstruction model for efficient 3d reconstruction and generation

Y Xu, Z Shi, W Yifan, H Chen, C Yang, S Peng… - … on Computer Vision, 2024 - Springer
We introduce GRM, a large-scale reconstructor capable of recovering a 3D asset from
sparse-view images in around 0.1 s. GRM is a feed-forward transformer-based model that …

Align your gaussians: Text-to-4d with dynamic 3d gaussians and composed diffusion models

H Ling, SW Kim, A Torralba… - Proceedings of the …, 2024 - openaccess.thecvf.com
Text-guided diffusion models have revolutionized image and video generation and have
also been successfully used for optimization-based 3D object synthesis. Here we instead …

Crm: Single image to 3d textured mesh with convolutional reconstruction model

Z Wang, Y Wang, Y Chen, C **ang, S Chen… - … on Computer Vision, 2024 - Springer
Feed-forward 3D generative models like the Large Reconstruction Model (LRM) have
demonstrated exceptional generation speed. However, the transformer-based methods do …

Instantmesh: Efficient 3d mesh generation from a single image with sparse-view large reconstruction models

J Xu, W Cheng, Y Gao, X Wang, S Gao… - arxiv preprint arxiv …, 2024 - arxiv.org
We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a
single image, featuring state-of-the-art generation quality and significant training scalability …

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

L Zhang, Z Wang, Q Zhang, Q Qiu, A Pang… - ACM Transactions on …, 2024 - dl.acm.org
In the realm of digital creativity, our potential to craft intricate 3D worlds from imagination is
often hampered by the limitations of existing digital tools, which demand extensive expertise …