Freestyle: Free lunch for text-guided style transfer using diffusion models

F He, G Li, M Zhang, L Yan, L Si, F Li… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid development of generative diffusion models has significantly advanced the field of
style transfer. However, most current style transfer methods based on diffusion models …

Instruct-ipt: All-in-one image processing transformer via weight modulation

Y Tian, J Han, H Chen, Y **, G Zhang, J Hu… - arxiv preprint arxiv …, 2024 - arxiv.org
Due to the unaffordable size and intensive computation costs of low-level vision models, All-
in-One models that are designed to address a handful of low-level vision tasks …

Bigger is not always better: Scaling properties of latent diffusion models

K Mei, Z Tu, M Delbracio, H Talebi… - … on Machine Learning …, 2024 - openreview.net
We study the scaling properties of latent diffusion models (LDMs) with an emphasis on their
sampling efficiency. While improved network architecture and inference algorithms have …

Referring Flexible Image Restoration

R Guan, R Hu, Z Zhou, T Xue, KL Man, J Smith… - arxiv preprint arxiv …, 2024 - arxiv.org
In reality, images often exhibit multiple degradations, such as rain and fog at night (triple
degradations). However, in many cases, individuals may not want to remove all …

TASR: Timestep-Aware Diffusion Model for Image Super-Resolution

Q Lin, X Sun, Y Gao, Y Zhong, D Li, Z Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org
Diffusion models have recently achieved outstanding results in the field of image super-
resolution. These methods typically inject low-resolution (LR) images via ControlNet. In this …

LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction

K Chen, H Li, JZ Zhou, Z Wang, L Wang - arxiv preprint arxiv:2407.05547, 2024 - arxiv.org
Event cameras harness advantages such as low latency, high temporal resolution, and high
dynamic range (HDR), compared to standard cameras. Due to the distinct imaging paradigm …