SinSR: diffusion-based image super-resolution in a single step

Y Wang, W Yang, X Chen, Y Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
While super-resolution (SR) methods based on diffusion models exhibit promising results
their practical application is hindered by the substantial number of required inference steps …

Hiprompt: Tuning-free higher-resolution generation with hierarchical mllm prompts

X Liu, Y He, L Guo, X Li, B **, P Li, Y Li… - arxiv preprint arxiv …, 2024 - arxiv.org
The potential for higher-resolution image generation using pretrained diffusion models is
immense, yet these models often struggle with issues of object repetition and structural …

Videodpo: Omni-preference alignment for video diffusion generation

R Liu, H Wu, Z Ziqiang, C Wei, Y He, R Pi… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent progress in generative diffusion models has greatly advanced text-to-video
generation. While text-to-video models trained on large-scale, diverse datasets can produce …

LLMs Meet Multimodal Generation and Editing: A Survey

Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
With the recent advancement in large language models (LLMs), there is a growing interest in
combining LLMs with multimodal learning. Previous surveys of multimodal large language …

Diffusehigh: Training-free progressive high-resolution image synthesis through structure guidance

Y Kim, G Hwang, J Zhang, E Park - arxiv preprint arxiv:2406.18459, 2024 - arxiv.org
Large-scale generative models, such as text-to-image diffusion models, have garnered
widespread attention across diverse domains due to their creative and high-fidelity image …

Megafusion: Extend diffusion models towards higher-resolution image generation without further tuning

H Wu, S Shen, Q Hu, X Zhang, Y Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
Diffusion models have emerged as frontrunners in text-to-image generation, however, their
fixed image resolution during training often leads to challenges in high-resolution image …

Flame diffuser: Wildfire image synthesis using mask guided diffusion

H Wang, SPH Boroujeni, X Chen… - … Conference on Big …, 2024 - ieeexplore.ieee.org
Wildfires are a significant threat to ecosystems and human infrastructure, leading to
widespread destruction and environmental degradation. Recent advancements in deep …

FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion

H Wang, SPH Boroujeni, X Chen, A Bastola… - arxiv preprint arxiv …, 2024 - arxiv.org
Wildfires are a significant threat to ecosystems and human infrastructure, leading to
widespread destruction and environmental degradation. Recent advancements in deep …

[PDF][PDF] Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models

A Tragakis, M Aversa, C Kaul… - arxiv preprint arxiv …, 2024 - physics.gla.ac.uk
In this work, we introduce Pixelsmith, a zero-shot text-to-image generative framework to
sample images at higher resolutions with a single GPU. We are the first to show that it is …

TDDSR: Single-Step Diffusion with Two Discriminators for Super Resolution

S Kim, TK Kim - arxiv preprint arxiv:2410.07663, 2024 - arxiv.org
Super-resolution methods are increasingly being specialized for both real-world and face-
specific tasks. However, many existing approaches rely on simplistic degradation models …