Linfusion: 1 gpu, 1 minute, 16k image
Modern diffusion models, particularly those utilizing a Transformer-based UNet for
denoising, rely heavily on self-attention operations to manage complex spatial relationships …
denoising, rely heavily on self-attention operations to manage complex spatial relationships …
Hiprompt: Tuning-free higher-resolution generation with hierarchical mllm prompts
The potential for higher-resolution image generation using pretrained diffusion models is
immense, yet these models often struggle with issues of object repetition and structural …
immense, yet these models often struggle with issues of object repetition and structural …
Diffusehigh: Training-free progressive high-resolution image synthesis through structure guidance
Large-scale generative models, such as text-to-image diffusion models, have garnered
widespread attention across diverse domains due to their creative and high-fidelity image …
widespread attention across diverse domains due to their creative and high-fidelity image …
Parallel Sequence Modeling via Generalized Spatial Propagation Network
We present the Generalized Spatial Propagation Network (GSPN), a new attention
mechanism optimized for vision tasks that inherently captures 2D spatial structures. Existing …
mechanism optimized for vision tasks that inherently captures 2D spatial structures. Existing …
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Visual diffusion models achieve remarkable progress, yet they are typically trained at limited
resolutions due to the lack of high-resolution data and constrained computation resources …
resolutions due to the lack of high-resolution data and constrained computation resources …
FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
Diffusion models are proficient at generating high-quality images. They are however
effective only when operating at the resolution used during training. Inference at a scaled …
effective only when operating at the resolution used during training. Inference at a scaled …
AccDiffusion v2: Towards More Accurate Higher-Resolution Diffusion Extrapolation
Diffusion models suffer severe object repetition and local distortion when the inference
resolution differs from its pre-trained resolution. We propose AccDiffusion v2, an accurate …
resolution differs from its pre-trained resolution. We propose AccDiffusion v2, an accurate …