3d gaussian splatting: Survey, technologies, challenges, and opportunities

Y Bao, T Ding, J Huo, Y Liu, Y Li, W Li… - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
3D Gaussian Splatting (3DGS) has emerged as a prominent technique with the potential to
become a mainstream method for 3D representations. It can effectively transform multi-view …

Generative ai meets 3d: A survey on text-to-3d in aigc era

C Li, C Zhang, J Cho, A Waghwase, LH Lee… - arxiv preprint arxiv …, 2023 - arxiv.org
Generative AI has made significant progress in recent years, with text-guided content
generation being the most practical as it facilitates interaction between human instructions …

Vd3d: Taming large video diffusion transformers for 3d camera control

S Bahmani, I Skorokhodov, A Siarohin… - arxiv preprint arxiv …, 2024 - arxiv.org
Modern text-to-video synthesis models demonstrate coherent, photorealistic generation of
complex videos from a text description. However, most existing models lack fine-grained …

Compositional 3d-aware video generation with llm director

H Zhu, T He, A Tang, J Guo, Z Chen, J Bian - arxiv preprint arxiv …, 2024 - arxiv.org
Significant progress has been made in text-to-video generation through the use of powerful
generative models and large-scale internet data. However, substantial challenges remain in …

Dreamscene4d: Dynamic multi-object scene generation from monocular videos

WH Chu, L Ke, K Fragkiadaki - arxiv preprint arxiv:2405.02280, 2024 - arxiv.org
Existing VLMs can track in-the-wild 2D video objects while current generative models
provide powerful visual priors for synthesizing novel views for the highly under-constrained …

Animate3d: Animating any 3d model with multi-view video diffusion

Y Jiang, C Yu, C Cao, F Wang, W Hu, J Gao - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advances in 4D generation mainly focus on generating 4D content by distilling pre-
trained text or single-view image-conditioned models. It is inconvenient for them to take …

Avatargo: Zero-shot 4d human-object interaction generation and animation

Y Cao, L Pan, K Han, KYK Wong, Z Liu - arxiv preprint arxiv:2410.07164, 2024 - arxiv.org
Recent advancements in diffusion models have led to significant improvements in the
generation and animation of 4D full-body human-object interactions (HOI). Nevertheless …

4dynamic: Text-to-4d generation with hybrid priors

YJ Yuan, L Kobbelt, J Liu, Y Zhang, P Wan… - arxiv preprint arxiv …, 2024 - arxiv.org
Due to the fascinating generative performance of text-to-image diffusion models, growing
text-to-3D generation works explore distilling the 2D generative priors into 3D, using the …

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

S Bahmani, I Skorokhodov, G Qian, A Siarohin… - arxiv preprint arxiv …, 2024 - arxiv.org
Numerous works have recently integrated 3D camera control into foundational text-to-video
models, but the resulting camera control is often imprecise, and video generation quality …

Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis

B Zeng, L Yang, S Li, J Liu, Z Zhang, J Tian… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advances in diffusion models have demonstrated exceptional capabilities in image
and video generation, further improving the effectiveness of 4D synthesis. Existing 4D …