SinSR: diffusion-based image super-resolution in a single step
While super-resolution (SR) methods based on diffusion models exhibit promising results
their practical application is hindered by the substantial number of required inference steps …
their practical application is hindered by the substantial number of required inference steps …
Hiprompt: Tuning-free higher-resolution generation with hierarchical mllm prompts
The potential for higher-resolution image generation using pretrained diffusion models is
immense, yet these models often struggle with issues of object repetition and structural …
immense, yet these models often struggle with issues of object repetition and structural …
Videodpo: Omni-preference alignment for video diffusion generation
Recent progress in generative diffusion models has greatly advanced text-to-video
generation. While text-to-video models trained on large-scale, diverse datasets can produce …
generation. While text-to-video models trained on large-scale, diverse datasets can produce …
LLMs Meet Multimodal Generation and Editing: A Survey
With the recent advancement in large language models (LLMs), there is a growing interest in
combining LLMs with multimodal learning. Previous surveys of multimodal large language …
combining LLMs with multimodal learning. Previous surveys of multimodal large language …
Diffusehigh: Training-free progressive high-resolution image synthesis through structure guidance
Large-scale generative models, such as text-to-image diffusion models, have garnered
widespread attention across diverse domains due to their creative and high-fidelity image …
widespread attention across diverse domains due to their creative and high-fidelity image …
Megafusion: Extend diffusion models towards higher-resolution image generation without further tuning
Diffusion models have emerged as frontrunners in text-to-image generation, however, their
fixed image resolution during training often leads to challenges in high-resolution image …
fixed image resolution during training often leads to challenges in high-resolution image …
Flame diffuser: Wildfire image synthesis using mask guided diffusion
Wildfires are a significant threat to ecosystems and human infrastructure, leading to
widespread destruction and environmental degradation. Recent advancements in deep …
widespread destruction and environmental degradation. Recent advancements in deep …
FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion
Wildfires are a significant threat to ecosystems and human infrastructure, leading to
widespread destruction and environmental degradation. Recent advancements in deep …
widespread destruction and environmental degradation. Recent advancements in deep …
[PDF][PDF] Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models
In this work, we introduce Pixelsmith, a zero-shot text-to-image generative framework to
sample images at higher resolutions with a single GPU. We are the first to show that it is …
sample images at higher resolutions with a single GPU. We are the first to show that it is …
TDDSR: Single-Step Diffusion with Two Discriminators for Super Resolution
Super-resolution methods are increasingly being specialized for both real-world and face-
specific tasks. However, many existing approaches rely on simplistic degradation models …
specific tasks. However, many existing approaches rely on simplistic degradation models …