Linfusion: 1 gpu, 1 minute, 16k image

S Liu, W Yu, Z Tan, X Wang - arxiv preprint arxiv:2409.02097, 2024 - arxiv.org
Modern diffusion models, particularly those utilizing a Transformer-based UNet for
denoising, rely heavily on self-attention operations to manage complex spatial relationships …

Hiprompt: Tuning-free higher-resolution generation with hierarchical mllm prompts

X Liu, Y He, L Guo, X Li, B **, P Li, Y Li… - arxiv preprint arxiv …, 2024 - arxiv.org
The potential for higher-resolution image generation using pretrained diffusion models is
immense, yet these models often struggle with issues of object repetition and structural …

Diffusehigh: Training-free progressive high-resolution image synthesis through structure guidance

Y Kim, G Hwang, J Zhang, E Park - arxiv preprint arxiv:2406.18459, 2024 - arxiv.org
Large-scale generative models, such as text-to-image diffusion models, have garnered
widespread attention across diverse domains due to their creative and high-fidelity image …

Parallel Sequence Modeling via Generalized Spatial Propagation Network

H Wang, W Byeon, J Xu, J Gu, KC Cheung… - arxiv preprint arxiv …, 2025 - arxiv.org
We present the Generalized Spatial Propagation Network (GSPN), a new attention
mechanism optimized for vision tasks that inherently captures 2D spatial structures. Existing …

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

H Qiu, S Zhang, Y Wei, R Chu, H Yuan, X Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Visual diffusion models achieve remarkable progress, yet they are typically trained at limited
resolutions due to the lack of high-resolution data and constrained computation resources …

FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion

H Yang, A Bulat, I Hadji, HX Pham, X Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Diffusion models are proficient at generating high-quality images. They are however
effective only when operating at the resolution used during training. Inference at a scaled …

AccDiffusion v2: Towards More Accurate Higher-Resolution Diffusion Extrapolation

Z Lin, M Lin, W Zhan, R Ji - arxiv preprint arxiv:2412.02099, 2024 - arxiv.org
Diffusion models suffer severe object repetition and local distortion when the inference
resolution differs from its pre-trained resolution. We propose AccDiffusion v2, an accurate …