- Academic Search

Y Liu, K Zhang, Y Li, Z Yan, C Gao, R Chen… - arxiv preprint arxiv …, 2024 - arxiv.org

Sora is a text-to-video generative AI model, released by OpenAI in February 2024. The
model is trained to generate videos of realistic or imaginative scenes from text instructions …

保存引用被引用数: 222 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library

The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

保存引用被引用数: 91 関連記事全 12 バージョン

[Free GPT-4]

[PDF] arxiv.org

Show-1: Marrying pixel and latent diffusion models for text-to-video generation

DJ Zhang, JZ Wu, JW Liu, R Zhao, L Ran, Y Gu… - International Journal of …, 2024 - Springer

Significant advancements have been achieved in the realm of large-scale pre-trained text-to-
video Diffusion Models (VDMs). However, previous methods either rely solely on pixel …

保存引用被引用数: 167 関連記事全 2 バージョン

[Free GPT-4]

[PDF] arxiv.org

Videopoet: A large language model for zero-shot video generation

D Kondratyuk, L Yu, X Gu, J Lezama, J Huang… - arxiv preprint arxiv …, 2023 - arxiv.org

We present VideoPoet, a language model capable of synthesizing high-quality video, with
matching audio, from a large variety of conditioning signals. VideoPoet employs a decoder …

保存引用被引用数: 176 関連記事全 5 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Fast high-resolution image synthesis with latent adversarial diffusion distillation

A Sauer, F Boesel, T Dockhorn, A Blattmann… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org

Diffusion models are the main driver of progress in image and video synthesis, but suffer
from slow inference speed. Distillation methods, like the recently introduced adversarial …

保存引用被引用数: 75 関連記事全 2 バージョン

[Free GPT-4]

[PDF] arxiv.org

Sparsectrl: Adding sparse controls to text-to-video diffusion models

Y Guo, C Yang, A Rao, M Agrawala, D Lin… - European Conference on …, 2024 - Springer

The development of text-to-video (T2V), ie, generating videos with a given text prompt, has
been significantly advanced in recent years. However, relying solely on text prompts often …

保存引用被引用数: 76 関連記事全 2 バージョン

[Free GPT-4]

[PDF] thecvf.com

Vbench: Comprehensive benchmark suite for video generative models

Z Huang, Y He, J Yu, F Zhang, C Si… - Proceedings of the …, 2024 - openaccess.thecvf.com

Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …

保存引用被引用数: 203 関連記事全 4 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Physdreamer: Physics-based interaction with 3d objects via video generation

T Zhang, HX Yu, R Wu, BY Feng, C Zheng… - … on Computer Vision, 2024 - Springer

Realistic object interactions are crucial for creating immersive virtual experiences, yet
synthesizing realistic 3D object dynamics in response to novel interactions remains a …

保存引用被引用数: 33 関連記事全 2 バージョン

[Free GPT-4]

[PDF] nature.com

Dynamical regimes of diffusion models

G Biroli, T Bonnaire, V De Bortoli, M Mézard - Nature Communications, 2024 - nature.com

We study generative diffusion models in the regime where both the data dimension and the
sample size are large, and the score function is trained optimally. Using statistical physics …

保存引用被引用数: 28 関連記事全 2 バージョン

[Free GPT-4]

[PDF] ieee.org

When does Sora show: The beginning of TAO to imaginative intelligence and scenarios engineering

FY Wang, Q Miao, L Li, Q Ni, X Li, J Li… - IEEE/CAA Journal of …, 2024 - ieeexplore.ieee.org

During our discussion at workshops for writing “What Does ChatGPT Say: The DAO from
Algorithmic Intelligence to Linguistic Intelligence”[1], we had expected the next milestone for …

保存引用被引用数: 57 関連記事全 4 バージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Lumiere: A space-time diffusion model for video generation

Sora: A review on background, technology, limitations, and opportunities of large vision models

State of the art on diffusion models for visual computing

Show-1: Marrying pixel and latent diffusion models for text-to-video generation

Videopoet: A large language model for zero-shot video generation

Fast high-resolution image synthesis with latent adversarial diffusion distillation

Sparsectrl: Adding sparse controls to text-to-video diffusion models

Vbench: Comprehensive benchmark suite for video generative models

Physdreamer: Physics-based interaction with 3d objects via video generation

Dynamical regimes of diffusion models

When does Sora show: The beginning of TAO to imaginative intelligence and scenarios engineering