- Academic Search

Z **ng, Q Feng, H Chen, Q Dai, H Hu, H Xu… - ACM Computing …, 2024 - dl.acm.org

The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …

Uložit Citovat Počet citací tohoto článku: 92 Související články Všechny verze (počet: 3)

[Free GPT-4]

[PDF] thecvf.com

Align your latents: High-resolution video synthesis with latent diffusion models

A Blattmann, R Rombach, H Ling… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding
excessive compute demands by training a diffusion model in a compressed lower …

Uložit Citovat Počet citací tohoto článku: 934 Související články Všechny verze (počet: 6) Zobrazit jako HTML

[Free GPT-4]

[PDF] thecvf.com

Scaling up gans for text-to-image synthesis

M Kang, JY Zhu, R Zhang, J Park… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recent success of text-to-image synthesis has taken the world by storm and captured the
general public's imagination. From a technical standpoint, it also marked a drastic change in …

Uložit Citovat Počet citací tohoto článku: 534 Související články Všechny verze (počet: 6) Zobrazit jako HTML

[Free GPT-4]

[PDF] arxiv.org

Animatediff: Animate your personalized text-to-image diffusion models without specific tuning

Y Guo, C Yang, A Rao, Z Liang, Y Wang, Y Qiao… - arxiv preprint arxiv …, 2023 - arxiv.org

With the advance of text-to-image (T2I) diffusion models (eg, Stable Diffusion) and
corresponding personalization techniques such as DreamBooth and LoRA, everyone can …

Uložit Citovat Počet citací tohoto článku: 631 Související články Všechny verze (počet: 3) Zobrazit jako HTML

[Free GPT-4]

[PDF] arxiv.org

Adversarial diffusion distillation

A Sauer, D Lorenz, A Blattmann… - European Conference on …, 2024 - Springer

Abstract We introduce Adversarial Diffusion Distillation (ADD), a novel training approach that
efficiently samples large-scale foundational image diffusion models in just 1–4 steps while …

Uložit Citovat Počet citací tohoto článku: 284 Související články Všechny verze (počet: 2)

[Free GPT-4]

[PDF] aaai.org

T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models

C Mou, X Wang, L **e, Y Wu, J Zhang, Z Qi… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

The incredible generative ability of large-scale text-to-image (T2I) models has demonstrated
strong power of learning complex structures and meaningful semantics. However, relying …

Uložit Citovat Počet citací tohoto článku: 864 Související články Všechny verze (počet: 3) Zobrazit jako HTML

[Free GPT-4]

[PDF] thecvf.com

Open-vocabulary panoptic segmentation with text-to-image diffusion models

J Xu, S Liu, A Vahdat, W Byeon… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present ODISE: Open-vocabulary DIffusion-based panoptic SEgmentation, which unifies
pre-trained text-image diffusion and discriminative models to perform open-vocabulary …

Uložit Citovat Počet citací tohoto článku: 429 Související články Všechny verze (počet: 6) Zobrazit jako HTML

[Free GPT-4]

[PDF] arxiv.org

Sdxl: Improving latent diffusion models for high-resolution image synthesis

D Podell, Z English, K Lacey, A Blattmann… - arxiv preprint arxiv …, 2023 - arxiv.org

We present SDXL, a latent diffusion model for text-to-image synthesis. Compared to
previous versions of Stable Diffusion, SDXL leverages a three times larger UNet backbone …

Uložit Citovat Počet citací tohoto článku: 1596 Související články Všechny verze (počet: 4) Zobrazit jako HTML

[Free GPT-4]

[PDF] thecvf.com

Structure and content-guided video synthesis with diffusion models

P Esser, J Chiu, P Atighehchian… - Proceedings of the …, 2023 - openaccess.thecvf.com

Text-guided generative diffusion models unlock powerful image creation and editing tools.
Recent approaches that edit the content of footage while retaining structure require …

Uložit Citovat Počet citací tohoto článku: 487 Související články Všechny verze (počet: 5) Zobrazit jako HTML

[Free GPT-4]

[PDF] thecvf.com

Fantasia3d: Disentangling geometry and appearance for high-quality text-to-3d content creation

R Chen, Y Chen, N Jiao, K Jia - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Automatic 3D content creation has achieved rapid progress recently due to the availability of
pre-trained, large language models and image diffusion models, forming the emerging topic …

Uložit Citovat Počet citací tohoto článku: 501 Související články Všechny verze (počet: 5) Zobrazit jako HTML

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

A survey on video diffusion models

Align your latents: High-resolution video synthesis with latent diffusion models

Scaling up gans for text-to-image synthesis

Animatediff: Animate your personalized text-to-image diffusion models without specific tuning

Adversarial diffusion distillation

T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models

Open-vocabulary panoptic segmentation with text-to-image diffusion models

Sdxl: Improving latent diffusion models for high-resolution image synthesis

Structure and content-guided video synthesis with diffusion models

Fantasia3d: Disentangling geometry and appearance for high-quality text-to-3d content creation