State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

State of the art on neural rendering

A Tewari, O Fried, J Thies, V Sitzmann… - Computer Graphics …, 2020 - Wiley Online Library
Efficient rendering of photo‐realistic virtual worlds is a long standing effort of computer
graphics. Modern graphics techniques have succeeded in synthesizing photo‐realistic …

Dust3r: Geometric 3d vision made easy

S Wang, V Leroy, Y Cabon… - Proceedings of the …, 2024 - openaccess.thecvf.com
Multi-view stereo reconstruction (MVS) in the wild requires to first estimate the camera
intrinsic and extrinsic parameters. These are usually tedious and cumbersome to obtain yet …

pixelsplat: 3d gaussian splats from image pairs for scalable generalizable 3d reconstruction

D Charatan, SL Li, A Tagliasacchi… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce pixelSplat a feed-forward model that learns to reconstruct 3D radiance fields
parameterized by 3D Gaussian primitives from pairs of images. Our model features real-time …

Generative novel view synthesis with 3d-aware diffusion models

ER Chan, K Nagano, MA Chan… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a diffusion-based model for 3D-aware generative novel view synthesis from as
few as a single input image. Our model samples from the distribution of possible renderings …

Reconfusion: 3d reconstruction with diffusion priors

R Wu, B Mildenhall, P Henzler, K Park… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract 3D reconstruction methods such as Neural Radiance Fields (NeRFs) excel at
rendering photorealistic novel views of complex scenes. However recovering a high-quality …

Lightgaussian: Unbounded 3d gaussian compression with 15x reduction and 200+ fps

Z Fan, K Wang, K Wen, Z Zhu, D Xu… - Advances in neural …, 2025 - proceedings.neurips.cc
Recent advances in real-time neural rendering using point-based techniques have enabled
broader adoption of 3D representations. However, foundational approaches like 3D …

Merf: Memory-efficient radiance fields for real-time view synthesis in unbounded scenes

C Reiser, R Szeliski, D Verbin, P Srinivasan… - ACM Transactions on …, 2023 - dl.acm.org
Neural radiance fields enable state-of-the-art photorealistic view synthesis. However,
existing radiance field representations are either too compute-intensive for real-time …

Mvsplat: Efficient 3d gaussian splatting from sparse multi-view images

Y Chen, H Xu, C Zheng, B Zhuang, M Pollefeys… - … on Computer Vision, 2024 - Springer
We introduce MVSplat, an efficient model that, given sparse multi-view images as input,
predicts clean feed-forward 3D Gaussians. To accurately localize the Gaussian centers, we …

Tapir: Tracking any point with per-frame initialization and temporal refinement

C Doersch, Y Yang, M Vecerik… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried
point on any physical surface throughout a video sequence. Our approach employs two …