Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Emu3: Next-token prediction is all you need
While next-token prediction is considered a promising path towards artificial general
intelligence, it has struggled to excel in multimodal tasks, which are still dominated by …
intelligence, it has struggled to excel in multimodal tasks, which are still dominated by …
Lumina-mgpt: Illuminate flexible photorealistic text-to-image generation with multimodal generative pretraining
We present Lumina-mGPT, a family of multimodal autoregressive models capable of various
vision and language tasks, particularly excelling in generating flexible photorealistic images …
vision and language tasks, particularly excelling in generating flexible photorealistic images …
Unidream: Unifying diffusion priors for relightable text-to-3d generation
Recent advancements in text-to-3D generation technology have significantly advanced the
conversion of textual descriptions into imaginative well-geometrical and finely textured 3D …
conversion of textual descriptions into imaginative well-geometrical and finely textured 3D …
Sana: Efficient high-resolution image synthesis with linear diffusion transformers
We introduce Sana, a text-to-image framework that can efficiently generate images up to
4096$\times $4096 resolution. Sana can synthesize high-resolution, high-quality images …
4096$\times $4096 resolution. Sana can synthesize high-resolution, high-quality images …
Janus-pro: Unified multimodal understanding and generation with data and model scaling
In this work, we introduce Janus-Pro, an advanced version of the previous work Janus.
Specifically, Janus-Pro incorporates (1) an optimized training strategy,(2) expanded training …
Specifically, Janus-Pro incorporates (1) an optimized training strategy,(2) expanded training …
PixWizard: Versatile image-to-image visual assistant with open-language instructions
This paper presents a versatile image-to-image visual assistant, PixWizard, designed for
image generation, manipulation, and translation based on free-from language instructions …
image generation, manipulation, and translation based on free-from language instructions …
Janusflow: Harmonizing autoregression and rectified flow for unified multimodal understanding and generation
We present JanusFlow, a powerful framework that unifies image understanding and
generation in a single model. JanusFlow introduces a minimalist architecture that integrates …
generation in a single model. JanusFlow introduces a minimalist architecture that integrates …
Customize your visual autoregressive recipe with set autoregressive modeling
We introduce a new paradigm for AutoRegressive (AR) image generation, termed Set
AutoRegressive Modeling (SAR). SAR generalizes the conventional AR to the next-set …
AutoRegressive Modeling (SAR). SAR generalizes the conventional AR to the next-set …
SANA: Efficient High-Resolution Text-to-Image Synthesis with Linear Diffusion Transformers
We introduce Sana, a text-to-image framework that can efficiently generate images up to
4096$\times $4096 resolution. Sana can synthesize high-resolution, high-quality images …
4096$\times $4096 resolution. Sana can synthesize high-resolution, high-quality images …
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training
Existing text-to-image (T2I) diffusion models face several limitations, including large model
sizes, slow runtime, and low-quality generation on mobile devices. This paper aims to …
sizes, slow runtime, and low-quality generation on mobile devices. This paper aims to …