State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

Adversarial diffusion distillation

A Sauer, D Lorenz, A Blattmann… - European Conference on …, 2024 - Springer
Abstract We introduce Adversarial Diffusion Distillation (ADD), a novel training approach that
efficiently samples large-scale foundational image diffusion models in just 1–4 steps while …

Grm: Large gaussian reconstruction model for efficient 3d reconstruction and generation

Y Xu, Z Shi, W Yifan, H Chen, C Yang, S Peng… - … on Computer Vision, 2024 - Springer
We introduce GRM, a large-scale reconstructor capable of recovering a 3D asset from
sparse-view images in around 0.1 s. GRM is a feed-forward transformer-based model that …

Gaussianeditor: Swift and controllable 3d editing with gaussian splatting

Y Chen, Z Chen, C Zhang, F Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract 3D editing plays a crucial role in many areas such as gaming and virtual reality.
Traditional 3D editing methods which rely on representations like meshes and point clouds …

One-step diffusion with distribution matching distillation

T Yin, M Gharbi, R Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Diffusion models generate high-quality images but require dozens of forward passes. We
introduce Distribution Matching Distillation (DMD) a procedure to transform a diffusion model …

Dynamic prompt learning: Addressing cross-attention leakage for text-based image editing

F Yang, S Yang, MA Butt… - Advances in Neural …, 2023 - proceedings.neurips.cc
Large-scale text-to-image generative models have been a ground-breaking development in
generative AI, with diffusion models showing their astounding ability to synthesize …

Diff-instruct: A universal approach for transferring knowledge from pre-trained diffusion models

W Luo, T Hu, S Zhang, J Sun, Z Li… - Advances in Neural …, 2024 - proceedings.neurips.cc
Due to the ease of training, ability to scale, and high sample quality, diffusion models (DMs)
have become the preferred option for generative modeling, with numerous pre-trained …

Headstudio: Text to animatable head avatars with 3d gaussian splatting

Z Zhou, F Ma, H Fan, Z Yang, Y Yang - European Conference on Computer …, 2024 - Springer
Creating digital avatars from textual prompts has long been a desirable yet challenging task.
Despite the promising results achieved with 2D diffusion priors, current methods struggle to …

Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

Headsculpt: Crafting 3d head avatars with text

X Han, Y Cao, K Han, X Zhu, J Deng… - Advances in …, 2024 - proceedings.neurips.cc
Recently, text-guided 3D generative methods have made remarkable advancements in
producing high-quality textures and geometry, capitalizing on the proliferation of large vision …