A comprehensive survey on 3D content generation

J Liu, X Huang, T Huang, L Chen, Y Hou… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent years have witnessed remarkable advances in artificial intelligence generated
content (AIGC), with diverse input modalities, eg, text, image, video, audio and 3D. The 3D is …

Ctrl-room: Controllable text-to-3d room meshes generation with layout constraints

C Fang, Y Dong, K Luo, X Hu, R Shrestha… - arxiv preprint arxiv …, 2023 - arxiv.org
Text-driven 3D indoor scene generation is useful for gaming, the film industry, and AR/VR
applications. However, existing methods cannot faithfully capture the room layout, nor do …

Blockfusion: Expandable 3d scene generation using latent tri-plane extrapolation

Z Wu, Y Li, H Yan, T Shang, W Sun, S Wang… - ACM Transactions on …, 2024 - dl.acm.org
We present BlockFusion, a diffusion-based model that generates 3D scenes as unit blocks
and seamlessly incorporates new blocks to extend the scene. BlockFusion is trained using …

Frankenstein: Generating semantic-compositional 3d scenes in one tri-plane

H Yan, Y Li, Z Wu, S Chen, W Sun, T Shang… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
We present Frankenstein, a diffusion-based framework that can generate semantic-
compositional 3D scenes in a single pass. Unlike existing methods that output a single …

Streetscapes: Large-scale consistent street view generation using autoregressive video diffusion

B Deng, R Tucker, Z Li, L Guibas, N Snavely… - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org
We present a method for generating Streetscapes—long sequences of views through an on-
the-fly synthesized city-scale scene. Our generation is conditioned by language input (eg …

Spaceblender: Creating context-rich collaborative spaces through generative 3d scene blending

N Numan, S Rajaram, BT Kumaravel… - Proceedings of the 37th …, 2024 - dl.acm.org
There is increased interest in using generative AI to create 3D spaces for Virtual Reality (VR)
applications. However, today's models produce artificial environments, falling short of …

Lexicon3d: Probing visual foundation models for complex 3d scene understanding

Y Man, S Zheng, Z Bao, M Hebert, LY Gui… - arxiv preprint arxiv …, 2024 - arxiv.org
Complex 3D scene understanding has gained increasing attention, with scene encoding
strategies playing a crucial role in this success. However, the optimal scene encoding …

Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting

Y Wang, X Qiu, J Liu, Z Chen, J Cai… - Advances in …, 2025 - proceedings.neurips.cc
Creating large-scale interactive 3D environments is essential for the development of
Robotics and Embodied AI research. However, generating diverse embodied environments …

Open-Universe Indoor Scene Generation using LLM Program Synthesis and Uncurated Object Databases

R Aguina-Kang, M Gumin, DH Han, S Morris… - arxiv preprint arxiv …, 2024 - arxiv.org
We present a system for generating indoor scenes in response to text prompts. The prompts
are not limited to a fixed vocabulary of scene descriptions, and the objects in generated …

BlobGEN-3D: Compositional 3D-Consistent Freeview Image Generation with 3D Blobs

C Liu, W Nie, S Liu, A Badki, H Su, M Mardani… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
Recent advances in text-to-image diffusion models have significantly enhanced image
generation quality, when trained on internet-scale data. However, existing methods are …