Vd3d: Taming large video diffusion transformers for 3d camera control

S Bahmani, I Skorokhodov, A Siarohin… - arxiv preprint arxiv …, 2024 - arxiv.org
Modern text-to-video synthesis models demonstrate coherent, photorealistic generation of
complex videos from a text description. However, most existing models lack fine-grained …

Gasp: Gaussian splatting for physic-based simulations

P Borycki, W Smolak, J Waczyńska, M Mazur… - arxiv preprint arxiv …, 2024 - arxiv.org
Physics simulation is paramount for modeling and utilization of 3D scenes in various real-
world applications. However, its integration with state-of-the-art 3D scene rendering …

Grounding Creativity in Physics: A Brief Survey of Physical Priors in AIGC

S Meng, Y Luo, P Liu - arxiv preprint arxiv:2502.07007, 2025 - arxiv.org
Recent advancements in AI-generated content have significantly improved the realism of 3D
and 4D generation. However, most existing methods prioritize appearance consistency …

PhysMotion: Physics-Grounded Dynamics From a Single Image

X Tan, Y Jiang, X Li, Z Zong, T **e, Y Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce PhysMotion, a novel framework that leverages principled physics-based
simulations to guide intermediate 3D representations generated from a single image and …

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

S Bahmani, I Skorokhodov, G Qian, A Siarohin… - arxiv preprint arxiv …, 2024 - arxiv.org
Numerous works have recently integrated 3D camera control into foundational text-to-video
models, but the resulting camera control is often imprecise, and video generation quality …

OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation

Y Lin, C Lin, J Xu, Y Mu - arxiv preprint arxiv:2501.18982, 2025 - arxiv.org
Recently, significant advancements have been made in the reconstruction and generation of
3D assets, including static cases and those with physical interactions. To recover the …

Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image

J Lin, Z Wang, S Jiang, Y Hou, M Jiang - arxiv preprint arxiv:2411.16800, 2024 - arxiv.org
The task of 4D content generation involves creating dynamic 3D models that evolve over
time in response to specific input conditions, such as images. Existing methods rely heavily …

Automated 3D Physical Simulation of Open-world Scene with Gaussian Splatting

H Zhao, H Wang, X Zhao, H Wang, Z Wu… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advancements in 3D generation models have opened new possibilities for
simulating dynamic 3D object movements and customizing behaviors, yet creating this …

InteRecon: Towards Reconstructing Interactivity of Personal Memorable Items in Mixed Reality

Z Li, J Li, Z **ong, S Zhang, F Faruqi, S Mueller… - arxiv preprint arxiv …, 2025 - arxiv.org
Digital capturing of memorable personal items is a key way to archive personal memories.
Although current digitization methods (eg, photos, videos, 3D scanning) can replicate the …

Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes

T Wimmer, M Oechsle, M Niemeyer… - arxiv preprint arxiv …, 2024 - arxiv.org
State-of-the-art novel view synthesis methods achieve impressive results for multi-view
captures of static 3D scenes. However, the reconstructed scenes still lack" liveliness," a key …