Advancing 3D point cloud understanding through deep transfer learning: A comprehensive survey
The 3D point cloud (3DPC) has significantly evolved and benefited from the advance of
deep learning (DL). However, the latter faces various issues, including the lack of data or …
deep learning (DL). However, the latter faces various issues, including the lack of data or …
Lt3sd: Latent trees for 3d scene diffusion
Layout-your-3D: Controllable and Precise 3D Generation with 2D Blueprint
We present Layout-Your-3D, a framework that allows controllable and compositional 3D
generation from text prompts. Existing text-to-3D methods often struggle to generate assets …
generation from text prompts. Existing text-to-3D methods often struggle to generate assets …
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
We present InfiniCube, a scalable method for generating unbounded dynamic 3D driving
scenes with high fidelity and controllability. Previous methods for scene generation either …
scenes with high fidelity and controllability. Previous methods for scene generation either …
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving
Understanding the evolution of 3D scenes is important for effective autonomous driving.
While conventional methods mode scene development with the motion of individual …
While conventional methods mode scene development with the motion of individual …
MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize
H Xu, L Chen, S Ding, Y Gao, D Jiang, Y Li… - arxiv preprint arxiv …, 2025 - arxiv.org
Diffusion-based generative models have achieved remarkable progress in visual content
generation. However, traditional diffusion models directly denoise the entire image from …
generation. However, traditional diffusion models directly denoise the entire image from …
SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model
H Zheng, Y Liang - arxiv preprint arxiv:2411.12290, 2024 - arxiv.org
Recent advancements in 3D diffusion-based semantic scene generation have gained
attention. However, existing methods rely on unconditional generation and require multiple …
attention. However, existing methods rely on unconditional generation and require multiple …
Map Imagination Like Blind Humans: Group Diffusion Model for Robotic Map Generation
Q Song, W Bai - arxiv preprint arxiv:2412.16908, 2024 - arxiv.org
Can robots imagine or generate maps like humans do, especially when only limited
information can be perceived like blind people? To address this challenging task, we …
information can be perceived like blind people? To address this challenging task, we …
OccVAR: Scalable 4D Occupancy Prediction via Next-Scale Prediction
In this paper, we propose OCCVAR, a generative occupancy world model that simulates the
movement of the ego vehicle and the evolution of the surrounding environment. Different …
movement of the ego vehicle and the evolution of the surrounding environment. Different …