Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Dense text-to-image generation with attention modulation
Existing text-to-image diffusion models struggle to synthesize realistic images given dense
captions, where each text prompt provides a detailed description for a specific image region …
captions, where each text prompt provides a detailed description for a specific image region …
Be yourself: Bounded attention for multi-subject text-to-image generation
Text-to-image diffusion models have an unprecedented ability to generate diverse and high-
quality images. However, they often struggle to faithfully capture the intended semantics of …
quality images. However, they often struggle to faithfully capture the intended semantics of …
Controllable generation with text-to-image diffusion models: A survey
In the rapidly advancing realm of visual generation, diffusion models have revolutionized the
landscape, marking a significant shift in capabilities with their impressive text-guided …
landscape, marking a significant shift in capabilities with their impressive text-guided …
Loco: Locally constrained training-free layout-to-image synthesis
Recent text-to-image diffusion models have reached an unprecedented level in generating
high-quality images. However, their exclusive reliance on textual prompts often falls short in …
high-quality images. However, their exclusive reliance on textual prompts often falls short in …
Multi-modal generative ai: Multi-modal llm, diffusion and beyond
Multi-modal generative AI has received increasing attention in both academia and industry.
Particularly, two dominant families of techniques are: i) The multi-modal large language …
Particularly, two dominant families of techniques are: i) The multi-modal large language …
Personalized residuals for concept-driven text-to-image generation
We present personalized residuals and localized attention-guided sampling for efficient
concept-driven generation using text-to-image diffusion models. Our method first represents …
concept-driven generation using text-to-image diffusion models. Our method first represents …
A survey of multimodal controllable diffusion models
Diffusion models have recently emerged as powerful generative models, producing high-
fidelity samples across domains. Despite this, they have two key challenges, including …
fidelity samples across domains. Despite this, they have two key challenges, including …
Layered rendering diffusion model for zero-shot guided image synthesis
Z Qi, G Huang, Z Huang, Q Guo, J Chen, J Han… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper introduces innovative solutions to enhance spatial controllability in diffusion
models reliant on text queries. We present two key innovations: Vision Guidance and the …
models reliant on text queries. We present two key innovations: Vision Guidance and the …
Object-level Visual Prompts for Compositional Image Generation
We introduce a method for composing object-level visual prompts within a text-to-image
diffusion model. Our approach addresses the task of generating semantically coherent …
diffusion model. Our approach addresses the task of generating semantically coherent …
Lomoe: Localized multi-object editing via multi-diffusion
Recent developments in diffusion models have demonstrated an exceptional capacity to
generate high-quality, prompt-conditioned image edits. Nevertheless, previous approaches …
generate high-quality, prompt-conditioned image edits. Nevertheless, previous approaches …