Google Akademik

Z **ng, Q Feng, H Chen, Q Dai, H Hu, H Xu… - ACM Computing …, 2024 - dl.acm.org

The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …

Kaydet Alıntı yap Alıntılanma sayısı: 92 İlgili makaleler 3 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Sora: A review on background, technology, limitations, and opportunities of large vision models

Y Liu, K Zhang, Y Li, Z Yan, C Gao, R Chen… - arxiv preprint arxiv …, 2024 - arxiv.org

Sora is a text-to-video generative AI model, released by OpenAI in February 2024. The
model is trained to generate videos of realistic or imaginative scenes from text instructions …

Kaydet Alıntı yap Alıntılanma sayısı: 223 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Lavie: High-quality video generation with cascaded latent diffusion models

Y Wang, X Chen, X Ma, S Zhou, Z Huang… - International Journal of …, 2024 - Springer

This work aims to learn a high-quality text-to-video (T2V) generative model by leveraging a
pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task …

Kaydet Alıntı yap Alıntılanma sayısı: 219 İlgili makaleler 3 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Sv3d: Novel multi-view synthesis and 3d generation from a single image using latent video diffusion

V Voleti, CH Yao, M Boss, A Letts, D Pankratz… - … on Computer Vision, 2024 - Springer

Abstract We present Stable Video 3D (SV3D)—a latent video diffusion model for high-
resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent …

Kaydet Alıntı yap Alıntılanma sayısı: 112 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Dynamicrafter: Animating open-domain images with video diffusion priors

J **ng, M **a, Y Zhang, H Chen, W Yu, H Liu… - … on Computer Vision, 2024 - Springer

Animating a still image offers an engaging visual experience. Traditional image animation
techniques mainly focus on animating natural scenes with stochastic dynamics (eg clouds …

Kaydet Alıntı yap Alıntılanma sayısı: 161 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Photorealistic video generation with diffusion models

A Gupta, L Yu, K Sohn, X Gu, M Hahn, FF Li… - … on Computer Vision, 2024 - Springer

We present WALT, a diffusion transformer for photorealistic video generation from text
prompts. Our approach has two key design decisions. First, we use a causal encoder to …

Kaydet Alıntı yap Alıntılanma sayısı: 128 İlgili makaleler 3 sürümün hepsi

[Free GPT-4]

[HTML] acm.org

Lumiere: A space-time diffusion model for video generation

O Bar-Tal, H Chefer, O Tov, C Herrmann… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org

We introduce Lumiere–a text-to-video diffusion model designed for synthesizing videos that
portray realistic, diverse and coherent motion–a pivotal challenge in video synthesis. To this …

Kaydet Alıntı yap Alıntılanma sayısı: 180 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Videopoet: A large language model for zero-shot video generation

D Kondratyuk, L Yu, X Gu, J Lezama, J Huang… - arxiv preprint arxiv …, 2023 - arxiv.org

We present VideoPoet, a language model capable of synthesizing high-quality video, with
matching audio, from a large variety of conditioning signals. VideoPoet employs a decoder …

Kaydet Alıntı yap Alıntılanma sayısı: 177 İlgili makaleler 5 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] nowpublishers.com

Multimodal foundation models: From specialists to general-purpose assistants

C Li, Z Gan, Z Yang, J Yang, L Li… - … and Trends® in …, 2024 - nowpublishers.com

Neural compression is the application of neural networks and other machine learning
methods to data compression. Recent advances in statistical machine learning have opened …

Kaydet Alıntı yap Alıntılanma sayısı: 214 İlgili makaleler 6 sürümün hepsi Kütüphane Araması HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Fast high-resolution image synthesis with latent adversarial diffusion distillation

A Sauer, F Boesel, T Dockhorn, A Blattmann… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org

Diffusion models are the main driver of progress in image and video synthesis, but suffer
from slow inference speed. Distillation methods, like the recently introduced adversarial …

Kaydet Alıntı yap Alıntılanma sayısı: 77 İlgili makaleler 2 sürümün hepsi

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

Stable video diffusion: Scaling latent video diffusion models to large datasets

A survey on video diffusion models

Sora: A review on background, technology, limitations, and opportunities of large vision models

Lavie: High-quality video generation with cascaded latent diffusion models

Sv3d: Novel multi-view synthesis and 3d generation from a single image using latent video diffusion

Dynamicrafter: Animating open-domain images with video diffusion priors

Photorealistic video generation with diffusion models

Lumiere: A space-time diffusion model for video generation

Videopoet: A large language model for zero-shot video generation

Multimodal foundation models: From specialists to general-purpose assistants

Fast high-resolution image synthesis with latent adversarial diffusion distillation