One-2-3-45++: Fast single image to 3d objects with consistent multi-view generation and 3d diffusion

M Liu, R Shi, L Chen, Z Zhang, C Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advancements in open-world 3D object generation have been remarkable with
image-to-3D methods offering superior fine-grained control over their text-to-3D …

Gaussiandreamer: Fast generation from text to 3d gaussians by bridging 2d and 3d diffusion models

T Yi, J Fang, J Wang, G Wu, L **e… - Proceedings of the …, 2024 - openaccess.thecvf.com
In recent times the generation of 3D assets from text prompts has shown impressive results.
Both 2D and 3D diffusion models can help generate decent 3D objects based on prompts …

4d-fy: Text-to-4d generation using hybrid score distillation sampling

S Bahmani, I Skorokhodov, V Rong… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent breakthroughs in text-to-4D generation rely on pre-trained text-to-image and text-to-
video models to generate dynamic 3D scenes. However current text-to-4D methods face a …

Zero123++: a single image to consistent multi-view diffusion base model

R Shi, H Chen, Z Zhang, M Liu, C Xu, X Wei… - arxiv preprint arxiv …, 2023 - arxiv.org
We report Zero123++, an image-conditioned diffusion model for generating 3D-consistent
multi-view images from a single input view. To take full advantage of pretrained 2D …

Tc4d: Trajectory-conditioned text-to-4d generation

S Bahmani, X Liu, W Yifan, I Skorokhodov… - … on Computer Vision, 2024 - Springer
Recent techniques for text-to-4D generation synthesize dynamic 3D scenes using
supervision from pre-trained text-to-video models. However, existing representations, such …

X-portrait: Expressive portrait animation with hierarchical motion attention

Y **e, H Xu, G Song, C Wang, Y Shi, L Luo - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org
We propose X-Portrait, an innovative conditional diffusion model tailored for generating
expressive and temporally coherent portrait animation. Specifically, given a single portrait as …

Sc4d: Sparse-controlled video-to-4d generation and motion transfer

Z Wu, C Yu, Y Jiang, C Cao, F Wang, X Bai - European Conference on …, 2024 - Springer
Recent advances in 2D/3D generative models enable the generation of dynamic 3D objects
from a single-view video. Existing approaches utilize score distillation sampling to form the …

Vd3d: Taming large video diffusion transformers for 3d camera control

S Bahmani, I Skorokhodov, A Siarohin… - arxiv preprint arxiv …, 2024 - arxiv.org
Modern text-to-video synthesis models demonstrate coherent, photorealistic generation of
complex videos from a text description. However, most existing models lack fine-grained …

Image sculpting: Precise object editing with 3d geometry control

J Yenphraphai, X Pan, S Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract We present Image Sculpting a new framework for editing 2D images by
incorporating tools from 3D geometry and graphics. This approach differs markedly from …

Magicpose: Realistic human poses and facial expressions retargeting with identity-aware diffusion

D Chang, Y Shi, Q Gao, H Xu, J Fu, G Song… - … on Machine Learning, 2023 - openreview.net
In this work, we propose MagicPose, a diffusion-based model for 2D human pose and facial
expression retargeting. Specifically, given a reference image, we aim to generate a person's …