- Academic Search

Uložit Citovat Počet citací tohoto článku: 1558 Související články Všechny verze (počet: 6)

Diffusion models: A comprehensive survey of methods and applications

L Yang, Z Zhang, Y Song, S Hong, R Xu, Y Zhao… - ACM Computing …, 2023 - dl.acm.org

Diffusion models have emerged as a powerful new family of deep generative models with
record-breaking performance in many applications, including image synthesis, video …

Uložit Citovat Počet citací tohoto článku: 934 Související články Všechny verze (počet: 6) Zobrazit jako HTML

Align your latents: High-resolution video synthesis with latent diffusion models

A Blattmann, R Rombach, H Ling… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding
excessive compute demands by training a diffusion model in a compressed lower …

Uložit Citovat Počet citací tohoto článku: 631 Související články Všechny verze (počet: 3) Zobrazit jako HTML

Animatediff: Animate your personalized text-to-image diffusion models without specific tuning

Y Guo, C Yang, A Rao, Z Liang, Y Wang, Y Qiao… - arxiv preprint arxiv …, 2023 - arxiv.org

With the advance of text-to-image (T2I) diffusion models (eg, Stable Diffusion) and
corresponding personalization techniques such as DreamBooth and LoRA, everyone can …

Uložit Citovat Počet citací tohoto článku: 479 Související články Všechny verze (počet: 7) Zobrazit jako HTML

Text2video-zero: Text-to-image diffusion models are zero-shot video generators

L Khachatryan, A Movsisyan… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent text-to-video generation approaches rely on computationally heavy training and
require large-scale video datasets. In this paper, we introduce a new task, zero-shot text-to …

Uložit Citovat Počet citací tohoto článku: 1596 Související články Všechny verze (počet: 4) Zobrazit jako HTML

Sdxl: Improving latent diffusion models for high-resolution image synthesis

D Podell, Z English, K Lacey, A Blattmann… - arxiv preprint arxiv …, 2023 - arxiv.org

We present SDXL, a latent diffusion model for text-to-image synthesis. Compared to
previous versions of Stable Diffusion, SDXL leverages a three times larger UNet backbone …

Uložit Citovat Počet citací tohoto článku: 487 Související články Všechny verze (počet: 5) Zobrazit jako HTML

Structure and content-guided video synthesis with diffusion models

P Esser, J Chiu, P Atighehchian… - Proceedings of the …, 2023 - openaccess.thecvf.com

Text-guided generative diffusion models unlock powerful image creation and editing tools.
Recent approaches that edit the content of footage while retaining structure require …

Uložit Citovat Počet citací tohoto článku: 719 Související články Všechny verze (počet: 4) Zobrazit jako HTML

Tune-a-video: One-shot tuning of image diffusion models for text-to-video generation

JZ Wu, Y Ge, X Wang, SW Lei, Y Gu… - Proceedings of the …, 2023 - openaccess.thecvf.com

To replicate the success of text-to-image (T2I) generation, recent works employ large-scale
video datasets to train a text-to-video (T2V) generator. Despite their promising results, such …

Uložit Citovat Počet citací tohoto článku: 303 Související články Všechny verze (počet: 6) Zobrazit jako HTML

Fatezero: Fusing attentions for zero-shot text-based video editing

C Qi, X Cun, Y Zhang, C Lei, X Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

The diffusion-based generative models have achieved remarkable success in text-based
image generation. However, since it contains enormous randomness in generation …