Graphdreamer: Compositional 3d scene synthesis from scene graphs

G Gao, W Liu, A Chen, A Geiger… - Proceedings of the …, 2024 - openaccess.thecvf.com
As pretrained text-to-image diffusion models become increasingly powerful recent efforts
have been made to distill knowledge from these text-to-image pretrained models for …

Anyhome: Open-vocabulary generation of structured and textured 3d homes

R Fu, Z Wen, Z Liu, S Sridhar - European Conference on Computer Vision, 2024 - Springer
Inspired by cognitive theories, we introduce AnyHome, a framework that translates any text
into well-structured and textured indoor scenes at a house-scale. By prompting Large …

Efficient diffusion models: A comprehensive survey from principles to practices

Z Ma, Y Zhang, G Jia, L Zhao, Y Ma, M Ma… - arxiv preprint arxiv …, 2024 - arxiv.org
As one of the most popular and sought-after generative models in the recent years, diffusion
models have sparked the interests of many researchers and steadily shown excellent …

Sg-bot: Object rearrangement via coarse-to-fine robotic imagination on scene graphs

G Zhai, X Cai, D Huang, Y Di… - … on Robotics and …, 2024 - ieeexplore.ieee.org
Object rearrangement is pivotal in robotic-environment interactions, representing a
significant capability in embodied AI. In this paper, we present SG-Bot, a novel …

Physcene: Physically interactable 3d scene synthesis for embodied ai

Y Yang, B Jia, P Zhi, S Huang - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
With recent developments in Embodied Artificial Intelligence (EAI) research there has been
a growing demand for high-quality large-scale interactive scene generation. While prior …

Secondpose: Se (3)-consistent dual-stream feature fusion for category-level pose estimation

Y Chen, Y Di, G Zhai, F Manhardt… - Proceedings of the …, 2024 - openaccess.thecvf.com
Category-level object pose estimation aiming to predict the 6D pose and 3D size of objects
from known categories typically struggles with large intra-class shape variation. Existing …

Diffuscene: Denoising diffusion models for generative indoor scene synthesis

J Tang, Y Nie, L Markhasin, A Dai… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present DiffuScene for indoor 3D scene synthesis based on a novel scene configuration
denoising diffusion model. It generates 3D instance properties stored in an unordered object …

Echoscene: Indoor scene generation via information echo over scene graph diffusion

G Zhai, EP Örnek, DZ Chen, R Liao, Y Di… - … on Computer Vision, 2024 - Springer
We present EchoScene, an interactive and controllable generative model that generates 3D
indoor scenes on scene graphs. EchoScene leverages a dual-branch diffusion model that …

Frankenstein: Generating semantic-compositional 3d scenes in one tri-plane

H Yan, Y Li, Z Wu, S Chen, W Sun, T Shang… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
We present Frankenstein, a diffusion-based framework that can generate semantic-
compositional 3D scenes in a single pass. Unlike existing methods that output a single …

Graph foundation models

H Mao, Z Chen, W Tang, J Zhao, Y Ma, T Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org
Graph Foundation Model (GFM) is a new trending research topic in the graph domain,
aiming to develop a graph model capable of generalizing across different graphs and tasks …