محقق Google

Z **ng, Q Feng, H Chen, Q Dai, H Hu, H Xu… - ACM Computing …, 2024‏ - dl.acm.org‏

The recent wave of AI-generated content (AIGC) has witnessed substantial success in
computer vision, with the diffusion model playing a crucial role in this achievement. Due to …‏

ذخیره ارجاع بیان شده در 97 یافته مقاله‌های مربوط تمام نسخه‌های 4

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Vbench: Comprehensive benchmark suite for video generative models‏

Z Huang, Y He, J Yu, F Zhang, C Si… - Proceedings of the …, 2024‏ - openaccess.thecvf.com‏

Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …‏

ذخیره ارجاع بیان شده در 225 یافته مقاله‌های مربوط تمام نسخه‌های 7 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Videocrafter2: Overcoming data limitations for high-quality video diffusion models‏

H Chen, Y Zhang, X Cun, M **a… - Proceedings of the …, 2024‏ - openaccess.thecvf.com‏

Text-to-video generation aims to produce a video based on a given prompt. Recently
several commercial video models have been able to generate plausible videos with minimal …‏

ذخیره ارجاع بیان شده در 192 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Sparsectrl: Adding sparse controls to text-to-video diffusion models‏

Y Guo, C Yang, A Rao, M Agrawala, D Lin… - European Conference on …, 2024‏ - Springer‏

The development of text-to-video (T2V), ie, generating videos with a given text prompt, has
been significantly advanced in recent years. However, relying solely on text prompts often …‏

ذخیره ارجاع بیان شده در 82 یافته مقاله‌های مربوط تمام نسخه‌های 8

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

I2vgen-xl: High-quality image-to-video synthesis via cascaded diffusion models‏

S Zhang, J Wang, Y Zhang, K Zhao, H Yuan… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Video synthesis has recently made remarkable strides benefiting from the rapid
development of diffusion models. However, it still encounters challenges in terms of …‏

ذخیره ارجاع بیان شده در 172 یافته مقاله‌های مربوط تمام نسخه‌های 2 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Miradata: A large-scale video dataset with long durations and structured captions‏

X Ju, Y Gao, Z Zhang, Z Yuan… - Advances in …, 2025‏ - proceedings.neurips.cc‏

Sora's high-motion intensity and long consistent videos have significantly impacted the field
of video generation, attracting unprecedented attention. However, existing publicly available …‏

ذخیره ارجاع بیان شده در 30 یافته مقاله‌های مربوط تمام نسخه‌های 4 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

From sora what we can see: A survey of text-to-video generation‏

R Sun, Y Zhang, T Shah, J Sun, S Zhang, W Li… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

With impressive achievements made, artificial intelligence is on the path forward to artificial
general intelligence. Sora, developed by OpenAI, which is capable of minute-level world …‏

ذخیره ارجاع بیان شده در 13 یافته مقاله‌های مربوط تمام نسخه‌های 4 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Motion-i2v: Consistent and controllable image-to-video generation with explicit motion modeling‏

X Shi, Z Huang, FY Wang, W Bian, D Li… - ACM SIGGRAPH 2024 …, 2024‏ - dl.acm.org‏

We introduce Motion-I2V, a novel framework for consistent and controllable text-guided
image-to-video generation (I2V). In contrast to previous methods that directly learn the …‏

ذخیره ارجاع بیان شده در 61 یافته مقاله‌های مربوط تمام نسخه‌های 5

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Streamingt2v: Consistent, dynamic, and extendable long video generation from text‏

R Henschel, L Khachatryan, D Hayrapetyan… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

Text-to-video diffusion models enable the generation of high-quality videos that follow text
instructions, making it easy to create diverse and individual content. However, existing …‏

ذخیره ارجاع بیان شده در 58 یافته مقاله‌های مربوط تمام نسخه‌های 4 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Tooncrafter: Generative cartoon interpolation‏

J **ng, H Liu, M **a, Y Zhang, X Wang, Y Shan… - ACM Transactions on …, 2024‏ - dl.acm.org‏

We introduce ToonCrafter, a novel approach that transcends traditional correspondence-
based cartoon video interpolation, paving the way for generative interpolation. Traditional …‏

ذخیره ارجاع بیان شده در 24 یافته مقاله‌های مربوط تمام نسخه‌های 5

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

Dynamicrafter: Animating open-domain images with video diffusion priors

A survey on video diffusion models‏

Vbench: Comprehensive benchmark suite for video generative models‏

Videocrafter2: Overcoming data limitations for high-quality video diffusion models‏

Sparsectrl: Adding sparse controls to text-to-video diffusion models‏

I2vgen-xl: High-quality image-to-video synthesis via cascaded diffusion models‏

Miradata: A large-scale video dataset with long durations and structured captions‏

From sora what we can see: A survey of text-to-video generation‏

Motion-i2v: Consistent and controllable image-to-video generation with explicit motion modeling‏

Streamingt2v: Consistent, dynamic, and extendable long video generation from text‏

Tooncrafter: Generative cartoon interpolation‏