X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models

Z Sun, Z Chu, P Zhang, T Wu, X Dong, Y Zang… - arxiv preprint arxiv …, 2024 - arxiv.org
In-context generation is a key component of large language models'(LLMs) open-task
generalization capability. By leveraging a few examples as context, LLMs can perform both …

Sampart3d: Segment any part in 3d objects

Y Yang, Y Huang, YC Guo, L Lu, X Wu, EY Lam… - arxiv preprint arxiv …, 2024 - arxiv.org
3D part segmentation is a crucial and challenging task in 3D perception, playing a vital role
in applications such as robotics, 3D generation, and 3D editing. Recent methods harness …

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Y He, Y Zhou, W Zhao, Z Wu, K **ao, W Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
We present StdGEN, an innovative pipeline for generating semantically decomposed high-
quality 3D characters from single images, enabling broad applications in virtual reality …

PrEditor3D: Fast and Precise 3D Shape Editing

Z Erkoç, C Gümeli, C Wang, M Nießner, A Dai… - arxiv preprint arxiv …, 2024 - arxiv.org
We propose a training-free approach to 3D editing that enables the editing of a single shape
within a few minutes. The edited 3D mesh aligns well with the prompts, and remains …

Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data

Z Sun, T Wu, P Zhang, Y Zang, X Dong, Y **ong, D Lin… - openreview.net
Recent years have witnessed remarkable progress in multi-view diffusion models for 3D
content creation. However, there remains a significant gap in image quality and prompt …