Openscene: 3d scene understanding with open vocabularies
Traditional 3D scene understanding approaches rely on labeled 3D datasets to train a
model for a single task with supervision. We propose OpenScene, an alternative approach …
model for a single task with supervision. We propose OpenScene, an alternative approach …
Neural 3d scene reconstruction with the manhattan-world assumption
This paper addresses the challenge of reconstructing 3D indoor scenes from multi-view
images. Many previous works have shown impressive reconstruction results on textured …
images. Many previous works have shown impressive reconstruction results on textured …
Fast point transformer
The recent success of neural networks enables a better interpretation of 3D point clouds, but
processing a large-scale 3D scene remains a challenging problem. Most current …
processing a large-scale 3D scene remains a challenging problem. Most current …
CMX: Cross-modal fusion for RGB-X semantic segmentation with transformers
Scene understanding based on image segmentation is a crucial component of autonomous
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …
Learning multi-view aggregation in the wild for large-scale 3d semantic segmentation
Recent works on 3D semantic segmentation propose to exploit the synergy between images
and point clouds by processing each modality with a dedicated network and projecting …
and point clouds by processing each modality with a dedicated network and projecting …
X-trans2cap: Cross-modal knowledge transfer using transformer for 3d dense captioning
Abstract 3D dense captioning aims to describe individual objects by natural language in 3D
scenes, where 3D scenes are usually represented as RGB-D scans or point clouds …
scenes, where 3D scenes are usually represented as RGB-D scans or point clouds …
Depthcrafter: Generating consistent long depth sequences for open-world videos
Despite significant advancements in monocular depth estimation for static images,
estimating video depth in the open world remains challenging, since open-world videos are …
estimating video depth in the open world remains challenging, since open-world videos are …
Peal: Prior-embedded explicit attention learning for low-overlap point cloud registration
J Yu, L Ren, Y Zhang, W Zhou… - Proceedings of the …, 2023 - openaccess.thecvf.com
Learning distinctive point-wise features is critical for low-overlap point cloud registration.
Recently, it has achieved huge success in incorporating Transformer into point cloud feature …
Recently, it has achieved huge success in incorporating Transformer into point cloud feature …
Box2mask: Weakly supervised 3d semantic instance segmentation using bounding boxes
Current 3D segmentation methods heavily rely on large-scale point-cloud datasets, which
are notoriously laborious to annotate. Few attempts have been made to circumvent the need …
are notoriously laborious to annotate. Few attempts have been made to circumvent the need …
Deep learning based 3D segmentation: A survey
3D segmentation is a fundamental and challenging problem in computer vision with
applications in autonomous driving and robotics. It has received significant attention from the …
applications in autonomous driving and robotics. It has received significant attention from the …