Meshanything v2: Artist-created mesh generation with adjacent mesh tokenization

Y Chen, Y Wang, Y Luo, Z Wang, Z Chen, J Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce MeshAnything V2, an autoregressive transformer that generates Artist-Created
Meshes (AM) aligned to given shapes. It can be integrated with various 3D asset production …

Cofie: Learning compact neural surface representations with coordinate fields

H Jiang, H Yang, G Pavlakos… - Advances in Neural …, 2025 - proceedings.neurips.cc
This paper introduces CoFie, a novel local geometry-aware neural surface representation.
CoFie is motivated by the theoretical analysis of local SDFs with quadratic approximation …

3dtopia-xl: Scaling high-quality 3d asset generation via primitive diffusion

Z Chen, J Tang, Y Dong, Z Cao, F Hong, Y Lan… - arxiv preprint arxiv …, 2024 - arxiv.org
The increasing demand for high-quality 3D assets across various industries necessitates
efficient and automated 3D content creation. Despite recent advancements in 3D generative …

An object is worth 64x64 pixels: Generating 3d object via image diffusion

X Yan, HH Lee, Z Wan, AX Chang - arxiv preprint arxiv:2408.03178, 2024 - arxiv.org
We introduce a new approach for generating realistic 3D models with UV maps through a
representation termed" Object Images." This approach encapsulates surface geometry …

G3pt: Unleash the power of autoregressive modeling in 3d generation via cross-scale querying transformer

J Zhang, F **ong, M Xu - arxiv preprint arxiv:2409.06322, 2024 - arxiv.org
Autoregressive transformers have revolutionized generative models in language processing
and shown substantial promise in image and video generation. However, these models face …

Flexitex: Enhancing texture generation with visual guidance

DD Jiang, X Yang, Z Zhao, S Zhang, J Yu, Z Lai… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent texture generation methods achieve impressive results due to the powerful
generative prior they leverage from large-scale text-to-image diffusion models. However …

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Z Wang, J Lorraine, Y Wang, H Su, J Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
This work explores expanding the capabilities of large language models (LLMs) pretrained
on text to generate 3D meshes within a unified model. This offers key advantages of (1) …

SAR3D: Autoregressive 3D object generation and understanding via multi-scale 3D VQVAE

Y Chen, Y Lan, S Zhou, T Wang, XI Pan - arxiv preprint arxiv:2411.16856, 2024 - arxiv.org
Autoregressive models have demonstrated remarkable success across various fields, from
large language models (LLMs) to large multimodal models (LMMs) and 2D content …

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation

Y Lan, S Zhou, Z Lyu, F Hong, S Yang, B Dai… - arxiv preprint arxiv …, 2024 - arxiv.org
While 3D content generation has advanced significantly, existing methods still face
challenges with input formats, latent space design, and output representations. This paper …

PhyCAGE: Physically Plausible Compositional 3D Asset Generation from a Single Image

H Yan, M Zhang, Y Li, C Ma, P Ji - arxiv preprint arxiv:2411.18548, 2024 - arxiv.org
We present PhyCAGE, the first approach for physically plausible compositional 3D asset
generation from a single image. Given an input image, we first generate consistent multi …