Google Académico

O Bar-Tal, H Chefer, O Tov, C Herrmann… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org

We introduce Lumiere–a text-to-video diffusion model designed for synthesizing videos that
portray realistic, diverse and coherent motion–a pivotal challenge in video synthesis. To this …

Guardar Citar Citado por 182 Artículos relacionados Las 2 versiones

[Free GPT-4]

[PDF] arxiv.org

Grm: Large gaussian reconstruction model for efficient 3d reconstruction and generation

Y Xu, Z Shi, W Yifan, H Chen, C Yang, S Peng… - … on Computer Vision, 2024 - Springer

We introduce GRM, a large-scale reconstructor capable of recovering a 3D asset from
sparse-view images in around 0.1 s. GRM is a feed-forward transformer-based model that …

Guardar Citar Citado por 94 Artículos relacionados Las 2 versiones

[Free GPT-4]

[PDF] oup.com

Opportunities and challenges of diffusion models for generative AI

M Chen, S Mei, J Fan, M Wang - National Science Review, 2024 - academic.oup.com

Diffusion models, a powerful and universal generative artificial intelligence technology, have
achieved tremendous success and opened up new possibilities in diverse applications. In …

Guardar Citar Citado por 6 Artículos relacionados Las 7 versiones

[Free GPT-4]

[PDF] thecvf.com

Gpt-4v (ision) is a human-aligned evaluator for text-to-3d generation

T Wu, G Yang, Z Li, K Zhang, Z Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Despite recent advances in text-to-3D generative methods there is a notable absence of
reliable evaluation metrics. Existing metrics usually focus on a single criterion each such as …

Guardar Citar Citado por 68 Artículos relacionados Las 3 versiones Versión en HTML

[Free GPT-4]

[PDF] acm.org

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

L Zhang, Z Wang, Q Zhang, Q Qiu, A Pang… - ACM Transactions on …, 2024 - dl.acm.org

In the realm of digital creativity, our potential to craft intricate 3D worlds from imagination is
often hampered by the limitations of existing digital tools, which demand extensive expertise …

Guardar Citar Citado por 49 Artículos relacionados

[Free GPT-4]

[PDF] thecvf.com

4d-fy: Text-to-4d generation using hybrid score distillation sampling

S Bahmani, I Skorokhodov, V Rong… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recent breakthroughs in text-to-4D generation rely on pre-trained text-to-image and text-to-
video models to generate dynamic 3D scenes. However current text-to-4D methods face a …

Guardar Citar Citado por 75 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]

[PDF] thecvf.com

Reconfusion: 3d reconstruction with diffusion priors

R Wu, B Mildenhall, P Henzler, K Park… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract 3D reconstruction methods such as Neural Radiance Fields (NeRFs) excel at
rendering photorealistic novel views of complex scenes. However recovering a high-quality …

Guardar Citar Citado por 108 Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Dmv3d: Denoising multi-view diffusion using 3d large reconstruction model

Y Xu, H Tan, F Luan, S Bi, P Wang, J Li, Z Shi… - arxiv preprint arxiv …, 2023 - arxiv.org

We propose\textbf {DMV3D}, a novel 3D generation approach that uses a transformer-based
3D large reconstruction model to denoise multi-view diffusion. Our reconstruction model …

Guardar Citar Citado por 126 Artículos relacionados Las 3 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Emdm: Efficient motion diffusion model for fast and high-quality motion generation

W Zhou, Z Dou, Z Cao, Z Liao, J Wang, W Wang… - … on Computer Vision, 2024 - Springer

Abstract We introduce Efficient Motion Diffusion Model (EMDM) for fast and high-quality
human motion generation. Current state-of-the-art generative diffusion models have …

Guardar Citar Citado por 36 Artículos relacionados Las 2 versiones

[Free GPT-4]

[PDF] arxiv.org

Diffusion model-based video editing: A survey

W Sun, RC Tu, J Liao, D Tao - arxiv preprint arxiv:2407.07111, 2024 - arxiv.org

The rapid development of diffusion models (DMs) has significantly advanced image and
video applications, making" what you want is what you see" a reality. Among these, video …

Guardar Citar Citado por 10 Artículos relacionados Las 2 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

State of the art on diffusion models for visual computing

Lumiere: A space-time diffusion model for video generation

Grm: Large gaussian reconstruction model for efficient 3d reconstruction and generation

Opportunities and challenges of diffusion models for generative AI

Gpt-4v (ision) is a human-aligned evaluator for text-to-3d generation

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

4d-fy: Text-to-4d generation using hybrid score distillation sampling

Reconfusion: 3d reconstruction with diffusion priors

Dmv3d: Denoising multi-view diffusion using 3d large reconstruction model

Emdm: Efficient motion diffusion model for fast and high-quality motion generation

Diffusion model-based video editing: A survey