- Academic Search

Y Wang, W Yang, X Chen, Y Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

While super-resolution (SR) methods based on diffusion models exhibit promising results
their practical application is hindered by the substantial number of required inference steps …

Save Cite Cited by 51 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Hiprompt: Tuning-free higher-resolution generation with hierarchical mllm prompts

X Liu, Y He, L Guo, X Li, B **, P Li, Y Li… - arxiv preprint arxiv …, 2024 - arxiv.org

The potential for higher-resolution image generation using pretrained diffusion models is
immense, yet these models often struggle with issues of object repetition and structural …

Save Cite Cited by 3 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Videodpo: Omni-preference alignment for video diffusion generation

R Liu, H Wu, Z Ziqiang, C Wei, Y He, R Pi… - arxiv preprint arxiv …, 2024 - arxiv.org

Recent progress in generative diffusion models has greatly advanced text-to-video
generation. While text-to-video models trained on large-scale, diverse datasets can produce …

Save Cite Cited by 1 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

LLMs Meet Multimodal Generation and Editing: A Survey

Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

With the recent advancement in large language models (LLMs), there is a growing interest in
combining LLMs with multimodal learning. Previous surveys of multimodal large language …

Save Cite Cited by 14 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Diffusehigh: Training-free progressive high-resolution image synthesis through structure guidance

Y Kim, G Hwang, J Zhang, E Park - arxiv preprint arxiv:2406.18459, 2024 - arxiv.org

Large-scale generative models, such as text-to-image diffusion models, have garnered
widespread attention across diverse domains due to their creative and high-fidelity image …

Save Cite Cited by 3 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Megafusion: Extend diffusion models towards higher-resolution image generation without further tuning

H Wu, S Shen, Q Hu, X Zhang, Y Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

Diffusion models have emerged as frontrunners in text-to-image generation, however, their
fixed image resolution during training often leads to challenges in high-resolution image …

Save Cite Cited by 2 Related articles All 5 versions Free GPT-4 View as HTML

Flame diffuser: Wildfire image synthesis using mask guided diffusion

H Wang, SPH Boroujeni, X Chen… - … Conference on Big …, 2024 - ieeexplore.ieee.org

Wildfires are a significant threat to ecosystems and human infrastructure, leading to
widespread destruction and environmental degradation. Recent advancements in deep …

Save Cite Cited by 2 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion

H Wang, SPH Boroujeni, X Chen, A Bastola… - arxiv preprint arxiv …, 2024 - arxiv.org

Wildfires are a significant threat to ecosystems and human infrastructure, leading to
widespread destruction and environmental degradation. Recent advancements in deep …

Save Cite Cited by 1 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] gla.ac.uk

[PDF][PDF] Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models

A Tragakis, M Aversa, C Kaul… - arxiv preprint arxiv …, 2024 - physics.gla.ac.uk

In this work, we introduce Pixelsmith, a zero-shot text-to-image generative framework to
sample images at higher resolutions with a single GPU. We are the first to show that it is …

Save Cite Cited by 1 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

TDDSR: Single-Step Diffusion with Two Discriminators for Super Resolution

S Kim, TK Kim - arxiv preprint arxiv:2410.07663, 2024 - arxiv.org

Super-resolution methods are increasingly being specialized for both real-world and face-
specific tasks. However, many existing approaches rely on simplistic degradation models …

Save Cite Cited by 1 Related articles All 2 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Make a cheap scaling: A self-cascade diffusion model for higher-resolution adaptation

SinSR: diffusion-based image super-resolution in a single step

Hiprompt: Tuning-free higher-resolution generation with hierarchical mllm prompts

Videodpo: Omni-preference alignment for video diffusion generation

LLMs Meet Multimodal Generation and Editing: A Survey

Diffusehigh: Training-free progressive high-resolution image synthesis through structure guidance

Megafusion: Extend diffusion models towards higher-resolution image generation without further tuning

Flame diffuser: Wildfire image synthesis using mask guided diffusion

FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion

[PDF][PDF] Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models

TDDSR: Single-Step Diffusion with Two Discriminators for Super Resolution