Nerfdiff: Single-image view synthesis with nerf-guided distillation from 3d-aware diffusion
Novel view synthesis from a single image requires inferring occluded regions of objects and
scenes whilst simultaneously maintaining semantic and physical consistency with the input …
scenes whilst simultaneously maintaining semantic and physical consistency with the input …
Gaussianavatar: Towards realistic human avatar modeling from a single video via animatable 3d gaussians
We present GaussianAvatar an efficient approach to creating realistic human avatars with
dynamic 3D appearances from a single video. We start by introducing animatable 3D …
dynamic 3D appearances from a single video. We start by introducing animatable 3D …
Segment anything in 3d with nerfs
Abstract Recently, the Segment Anything Model (SAM) emerged as a powerful vision
foundation model which is capable to segment anything in 2D images. This paper aims to …
foundation model which is capable to segment anything in 2D images. This paper aims to …
Neural fields in visual computing and beyond
Recent advances in machine learning have led to increased interest in solving visual
computing problems using methods that employ coordinate‐based neural networks. These …
computing problems using methods that employ coordinate‐based neural networks. These …
Feature 3dgs: Supercharging 3d gaussian splatting to enable distilled feature fields
Abstract 3D scene representations have gained immense popularity in recent years.
Methods that use Neural Radiance fields are versatile for traditional tasks such as novel …
Methods that use Neural Radiance fields are versatile for traditional tasks such as novel …
Conditional object-centric learning from video
Object-centric representations are a promising path toward more systematic generalization
by providing flexible abstractions upon which compositional world models can be built …
by providing flexible abstractions upon which compositional world models can be built …
Object scene representation transformer
A compositional understanding of the world in terms of objects and their geometry in 3D
space is considered a cornerstone of human cognition. Facilitating the learning of such a …
space is considered a cornerstone of human cognition. Facilitating the learning of such a …
Simple unsupervised object-centric learning for complex and naturalistic videos
Unsupervised object-centric learning aims to represent the modular, compositional, and
causal structure of a scene as a set of object representations and thereby promises to …
causal structure of a scene as a set of object representations and thereby promises to …
Weakly supervised 3d open-vocabulary segmentation
Open-vocabulary segmentation of 3D scenes is a fundamental function of human perception
and thus a crucial objective in computer vision research. However, this task is heavily …
and thus a crucial objective in computer vision research. However, this task is heavily …
Towards unsupervised object detection from lidar point clouds
In this paper, we study the problem of unsupervised object detection from 3D point clouds in
self-driving scenes. We present a simple yet effective method that exploits (i) point clustering …
self-driving scenes. We present a simple yet effective method that exploits (i) point clustering …