State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

3d gaussian splatting as new era: A survey

B Fei, J Xu, R Zhang, Q Zhou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
3D Gaussian Splatting (3D-GS) has emerged as a significant advancement in the field of
Computer Graphics, offering explicit scene representation and novel view synthesis without …

Objaverse-xl: A universe of 10m+ 3d objects

M Deitke, R Liu, M Wallingford, H Ngo… - Advances in …, 2023 - proceedings.neurips.cc
Natural language processing and 2D vision models have attained remarkable proficiency on
many tasks primarily by escalating the scale of training data. However, 3D vision tasks have …

Sv3d: Novel multi-view synthesis and 3d generation from a single image using latent video diffusion

V Voleti, CH Yao, M Boss, A Letts, D Pankratz… - … on Computer Vision, 2024 - Springer
Abstract We present Stable Video 3D (SV3D)—a latent video diffusion model for high-
resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent …

Instantmesh: Efficient 3d mesh generation from a single image with sparse-view large reconstruction models

J Xu, W Cheng, Y Gao, X Wang, S Gao… - arxiv preprint arxiv …, 2024 - arxiv.org
We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a
single image, featuring state-of-the-art generation quality and significant training scalability …

Richdreamer: A generalizable normal-depth diffusion model for detail richness in text-to-3d

L Qiu, G Chen, X Gu, Q Zuo, M Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Lifting 2D diffusion for 3D generation is a challenging problem due to the lack of geometric
prior and the complex entanglement of materials and lighting in natural images. Existing …

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

L Zhang, Z Wang, Q Zhang, Q Qiu, A Pang… - ACM Transactions on …, 2024 - dl.acm.org
In the realm of digital creativity, our potential to craft intricate 3D worlds from imagination is
often hampered by the limitations of existing digital tools, which demand extensive expertise …

Geowizard: Unleashing the diffusion priors for 3d geometry estimation from a single image

X Fu, W Yin, M Hu, K Wang, Y Ma, P Tan… - … on Computer Vision, 2024 - Springer
We introduce GeoWizard, a new generative foundation model designed for estimating
geometric attributes, eg, depth and normals, from single images. While significant research …

Triposr: Fast 3d object reconstruction from a single image

D Tochilkin, D Pankratz, Z Liu, Z Huang, A Letts… - arxiv preprint arxiv …, 2024 - arxiv.org
This technical report introduces TripoSR, a 3D reconstruction model leveraging transformer
architecture for fast feed-forward 3D generation, producing 3D mesh from a single image in …

Mvdiffusion++: A dense high-resolution multi-view diffusion model for single or sparse-view 3d object reconstruction

S Tang, J Chen, D Wang, C Tang, F Zhang… - … on Computer Vision, 2024 - Springer
This paper presents a neural architecture MVDiffusion++ for 3D object reconstruction that
synthesizes dense and high-resolution views of an object given one or a few images without …