Pixel-aligned recurrent queries for multi-view 3d object detection
We present PARQ-a multi-view 3D object detector with transformer and pixel-aligned
recurrent queries. Unlike previous works that use learnable features or only encode 3D point …
recurrent queries. Unlike previous works that use learnable features or only encode 3D point …
Learning-based multi-view stereo: a survey
3D reconstruction aims to recover the dense 3D structure of a scene. It plays an essential
role in various applications such as Augmented/Virtual Reality (AR/VR), autonomous driving …
role in various applications such as Augmented/Virtual Reality (AR/VR), autonomous driving …
A local tangent plane distance-based approach to 3D point cloud segmentation via clustering
This paper proposes an effective measure for the planar segmentation problem based on
the clustering method. It uses the distance from a point to the local plane as a metric to …
the clustering method. It uses the distance from a point to the local plane as a metric to …
Self-supervised super-plane for neural 3d reconstruction
Neural implicit surface representation methods show impressive reconstruction results but
struggle to handle texture-less planar regions that widely exist in indoor scenes. Existing …
struggle to handle texture-less planar regions that widely exist in indoor scenes. Existing …
Planeseg: Building a plug-in for boosting planar region segmentation
Existing methods in planar region segmentation suffer the problems of vague boundaries
and failure to detect small-sized regions. To address these, this study presents an end-to …
and failure to detect small-sized regions. To address these, this study presents an end-to …
Parf: Primitive-aware radiance fusion for indoor scene novel view synthesis
This paper proposes a method for fast scene radiance field reconstruction with strong novel
view synthesis performance and convenient scene editing functionality. The key idea is to …
view synthesis performance and convenient scene editing functionality. The key idea is to …
Visfusion: Visibility-aware online 3d scene reconstruction from videos
We propose VisFusion, a visibility-aware online 3D scene reconstruction approach from
posed monocular videos. In particular, we aim to reconstruct the scene from volumetric …
posed monocular videos. In particular, we aim to reconstruct the scene from volumetric …
Structural multiplane image: Bridging neural view synthesis and 3d reconstruction
Abstract The Multiplane Image (MPI), containing a set of fronto-parallel RGBA layers, is an
effective and efficient representation for view synthesis from sparse inputs. Yet, its fixed …
effective and efficient representation for view synthesis from sparse inputs. Yet, its fixed …
Frozenrecon: Pose-free 3d scene reconstruction with frozen depth models
3D scene reconstruction is a long-standing vision task. Existing approaches can be
categorized into geometry-based and learning-based methods. The former leverages multi …
categorized into geometry-based and learning-based methods. The former leverages multi …
AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings
Extracting planes from a 3D scene is useful for downstream tasks in robotics and augmented
reality. In this paper we tackle the problem of estimating the planar surfaces in a scene from …
reality. In this paper we tackle the problem of estimating the planar surfaces in a scene from …