State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

Recent advances in implicit representation-based 3d shape generation

JM Sun, T Wu, L Gao - Visual Intelligence, 2024 - Springer
Various techniques have been developed and introduced to address the pressing need to
create three-dimensional (3D) content for advanced applications such as virtual reality and …

Adversarial diffusion distillation

A Sauer, D Lorenz, A Blattmann… - European Conference on …, 2024 - Springer
Abstract We introduce Adversarial Diffusion Distillation (ADD), a novel training approach that
efficiently samples large-scale foundational image diffusion models in just 1–4 steps while …

Hexplane: A fast representation for dynamic scenes

A Cao, J Johnson - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Modeling and re-rendering dynamic 3D scenes is a challenging task in 3D vision. Prior
approaches build on NeRF and rely on implicit representations. This is slow since it requires …

Text2room: Extracting textured 3d meshes from 2d text-to-image models

L Höllein, A Cao, A Owens… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract We present Text2Room, a method for generating room-scale textured 3D meshes
from a given text prompt as input. To this end, we leverage pre-trained 2D text-to-image …

Align your gaussians: Text-to-4d with dynamic 3d gaussians and composed diffusion models

H Ling, SW Kim, A Torralba… - Proceedings of the …, 2024 - openaccess.thecvf.com
Text-guided diffusion models have revolutionized image and video generation and have
also been successfully used for optimization-based 3D object synthesis. Here we instead …

Tri-miprf: Tri-mip representation for efficient anti-aliasing neural radiance fields

W Hu, Y Wang, L Ma, B Yang, L Gao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite the tremendous progress in neural radiance fields (NeRF), we still face a dilemma of
the trade-off between quality and efficiency, eg, MipNeRF presents fine-detailed and anti …

Dreamllm: Synergistic multimodal comprehension and creation

R Dong, C Han, Y Peng, Z Qi, Z Ge, J Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper presents DreamLLM, a learning framework that first achieves versatile
Multimodal Large Language Models (MLLMs) empowered with frequently overlooked …

4d-fy: Text-to-4d generation using hybrid score distillation sampling

S Bahmani, I Skorokhodov, V Rong… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent breakthroughs in text-to-4D generation rely on pre-trained text-to-image and text-to-
video models to generate dynamic 3D scenes. However current text-to-4D methods face a …

Hyperdiffusion: Generating implicit neural fields with weight-space diffusion

Z Erkoç, F Ma, Q Shan, M Nießner… - Proceedings of the …, 2023 - openaccess.thecvf.com
Implicit neural fields, typically encoded by a multilayer perceptron (MLP) that maps from
coordinates (eg, xyz) to signals (eg, signed distances), have shown remarkable promise as …