How people prompt generative ai to create interactive vr scenes

S Aghel Manesh, T Zhang, Y Onishi, K Hara… - Proceedings of the …, 2024 - dl.acm.org
Generative AI tools can provide people with the ability to create virtual environments and
scenes with natural language prompts. Yet, how people will formulate such prompts is …

Towards a science exocortex

KG Yager - Digital Discovery, 2024 - pubs.rsc.org
Artificial intelligence (AI) methods are poised to revolutionize intellectual work, with
generative AI enabling automation of text analysis, text generation, and simple decision …

Autoregressive Models in Vision: A Survey

J **ong, G Liu, L Huang, C Wu, T Wu, Y Mu… - arxiv preprint arxiv …, 2024 - arxiv.org
Autoregressive modeling has been a huge success in the field of natural language
processing (NLP). Recently, autoregressive models have emerged as a significant area of …

A Survey on Vision Autoregressive Model

K Jiang, J Huang - arxiv preprint arxiv:2411.08666, 2024 - arxiv.org
Autoregressive models have demonstrated great performance in natural language
processing (NLP) with impressive scalability, adaptability and generalizability. Inspired by …

Reconstructing Animals and the Wild

P Kulits, MJ Black, S Zuffi - arxiv preprint arxiv:2411.18807, 2024 - arxiv.org
The idea of 3D reconstruction as scene understanding is foundational in computer vision.
Reconstructing 3D scenes from 2D visual observations requires strong priors to …

Synthesizing 3D Abstractions by Inverting Procedural Buildings with Transformers

M Dax, J Berbel, J Stria, L Guibas… - arxiv preprint arxiv …, 2025 - arxiv.org
We generate abstractions of buildings, reflecting the essential aspects of their geometry and
structure, by learning to invert procedural models. We first build a dataset of abstract …

DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation

W Zhao, YP Cao, J Xu, Y Dong, Y Shan - arxiv preprint arxiv:2412.15200, 2024 - arxiv.org
Procedural Content Generation (PCG) is powerful in creating high-quality 3D contents, yet
controlling it to produce desired shapes is difficult and often requires extensive parameter …

[HTML][HTML] Near Real-Time 3D Reconstruction of Construction Sites Based on Surveillance Cameras

A Sun, X An, P Li, M Lv, W Liu - Buildings, 2025 - mdpi.com
The 3D reconstruction of construction sites is of great importance for construction progress,
quality, and safety management. Currently, most of the existing 3D reconstruction methods …

LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models

FY Sun, W Liu, S Gu, D Lim, G Bhat, F Tombari… - arxiv preprint arxiv …, 2024 - arxiv.org
Open-universe 3D layout generation arranges unlabeled 3D assets conditioned on
language instruction. Large language models (LLMs) struggle with generating physically …

Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors

A Rich, N Stier, P Sen, T Höllerer - arxiv preprint arxiv:2412.05771, 2024 - arxiv.org
The promise of unsupervised multi-view-stereo (MVS) is to leverage large unlabeled
datasets, yet current methods underperform when training on difficult data, such as …