Академия Google

Сохранить Цитировать Цитируется: 174 Похожие статьи Все версии статьи (12)

Image synthesis with adversarial networks: A comprehensive survey and case studies

P Shamsolmoali, M Zareapoor, E Granger, H Zhou… - Information …, 2021 - Elsevier

Abstract Generative Adversarial Networks (GANs) have been extremely successful in
various application domains such as computer vision, medicine, and natural language …

Сохранить Цитировать Цитируется: 169 Похожие статьи Все версии статьи (6)

Dynamicrafter: Animating open-domain images with video diffusion priors

J **ng, M **a, Y Zhang, H Chen, W Yu, H Liu… - … on Computer Vision, 2024 - Springer

Animating a still image offers an engaging visual experience. Traditional image animation
techniques mainly focus on animating natural scenes with stochastic dynamics (eg clouds …

Сохранить Цитировать Цитируется: 175 Похожие статьи Все версии статьи (3) В виде HTML

Latte: Latent diffusion transformer for video generation

X Ma, Y Wang, G Jia, X Chen, Z Liu, YF Li… - arxiv preprint arxiv …, 2024 - arxiv.org

We propose a novel Latent Diffusion Transformer, namely Latte, for video generation. Latte
first extracts spatio-temporal tokens from input videos and then adopts a series of …

Сохранить Цитировать Цитируется: 313 Похожие статьи Все версии статьи (2) В виде HTML

Latent video diffusion models for high-fidelity long video generation

Y He, T Yang, Y Zhang, Y Shan, Q Chen - arxiv preprint arxiv:2211.13221, 2022 - arxiv.org

AI-generated content has attracted lots of attention recently, but photo-realistic video
synthesis is still challenging. Although many attempts using GANs and autoregressive …

Сохранить Цитировать Цитируется: 177 Похожие статьи Все версии статьи (10) В виде HTML

Video probabilistic diffusion models in projected latent space

S Yu, K Sohn, S Kim, J Shin - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Despite the remarkable progress in deep generative models, synthesizing high-resolution
and temporally coherent videos still remains a challenge due to their high-dimensionality …

Сохранить Цитировать Цитируется: 295 Похожие статьи Все версии статьи (4) В виде HTML

Videofusion: Decomposed diffusion models for high-quality video generation

Z Luo, D Chen, Y Zhang, Y Huang, L Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

A diffusion probabilistic model (DPM), which constructs a forward diffusion process by
gradually adding noise to data points and learns the reverse denoising process to generate …

Сохранить Цитировать Цитируется: 149 Похожие статьи Все версии статьи (7) В виде HTML

Conditional image-to-video generation with latent flow diffusion models

H Ni, C Shi, K Li, SX Huang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video
starting from an image (eg, a person's face) and a condition (eg, an action class label like …

Сохранить Цитировать Цитируется: 71 Похожие статьи Все версии статьи (5) В виде HTML

Simda: Simple diffusion adapter for efficient video generation

Z **ng, Q Dai, H Hu, Z Wu… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

The recent wave of AI-generated content has witnessed the great development and success
of Text-to-Image (T2I) technologies. By contrast Text-to-Video (T2V) still falls short of …