3d gaussian splatting: Survey, technologies, challenges, and opportunities

Y Bao, T Ding, J Huo, Y Liu, Y Li, W Li… - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
3D Gaussian Splatting (3DGS) has emerged as a prominent technique with the potential to
become a mainstream method for 3D representations. It can effectively transform multi-view …

Generative ai meets 3d: A survey on text-to-3d in aigc era

C Li, C Zhang, J Cho, A Waghwase, LH Lee… - arxiv preprint arxiv …, 2023 - arxiv.org
Generative AI has made significant progress in recent years, with text-guided content
generation being the most practical as it facilitates interaction between human instructions …

Dreamscene4d: Dynamic multi-object scene generation from monocular videos

WH Chu, L Ke, K Fragkiadaki - arxiv preprint arxiv:2405.02280, 2024 - arxiv.org
View-predictive generative models provide strong priors for lifting object-centric images and
videos into 3D and 4D through rendering and score distillation objectives. A question then …

Vd3d: Taming large video diffusion transformers for 3d camera control

S Bahmani, I Skorokhodov, A Siarohin… - arxiv preprint arxiv …, 2024 - arxiv.org
Modern text-to-video synthesis models demonstrate coherent, photorealistic generation of
complex videos from a text description. However, most existing models lack fine-grained …

Vivid-zoo: Multi-view video generation with diffusion model

B Li, C Zheng, W Zhu, J Mai, B Zhang… - Advances in …, 2025 - proceedings.neurips.cc
While diffusion models have shown impressive performance in 2D image/video generation,
diffusion-based Text-to-Multi-view-Video (T2MVid) generation remains underexplored. The …

Eg4d: Explicit generation of 4d object without score distillation

Q Sun, Z Guo, Z Wan, JN Yan, S Yin, W Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, the increasing demand for dynamic 3D assets in design and gaming
applications has given rise to powerful generative pipelines capable of synthesizing high …

Compositional 3d-aware video generation with llm director

H Zhu, T He, A Tang, J Guo, Z Chen, J Bian - arxiv preprint arxiv …, 2024 - arxiv.org
Significant progress has been made in text-to-video generation through the use of powerful
generative models and large-scale internet data. However, substantial challenges remain in …

Elastogen: 4d generative elastodynamics

Y Feng, Y Shang, X Feng, L Lan, S Zhe, T Shao… - arxiv preprint arxiv …, 2024 - arxiv.org
We present ElastoGen, a knowledge-driven AI model that generates physically accurate 4D
elastodynamics. Unlike deep models that learn from video-or image-based observations …

Animate3d: Animating any 3d model with multi-view video diffusion

Y Jiang, C Yu, C Cao, F Wang, W Hu, J Gao - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advances in 4D generation mainly focus on generating 4D content by distilling pre-
trained text or single-view image-conditioned models. It is inconvenient for them to take …

Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis

B Zeng, L Yang, S Li, J Liu, Z Zhang, J Tian… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advances in diffusion models have demonstrated exceptional capabilities in image
and video generation, further improving the effectiveness of 4D synthesis. Existing 4D …