Openscene: 3d scene understanding with open vocabularies

S Peng, K Genova, C Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Traditional 3D scene understanding approaches rely on labeled 3D datasets to train a
model for a single task with supervision. We propose OpenScene, an alternative approach …

Neural 3d scene reconstruction with the manhattan-world assumption

H Guo, S Peng, H Lin, Q Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper addresses the challenge of reconstructing 3D indoor scenes from multi-view
images. Many previous works have shown impressive reconstruction results on textured …

Fast point transformer

C Park, Y Jeong, M Cho, J Park - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
The recent success of neural networks enables a better interpretation of 3D point clouds, but
processing a large-scale 3D scene remains a challenging problem. Most current …

CMX: Cross-modal fusion for RGB-X semantic segmentation with transformers

J Zhang, H Liu, K Yang, X Hu, R Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Scene understanding based on image segmentation is a crucial component of autonomous
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …

Learning multi-view aggregation in the wild for large-scale 3d semantic segmentation

D Robert, B Vallet, L Landrieu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Recent works on 3D semantic segmentation propose to exploit the synergy between images
and point clouds by processing each modality with a dedicated network and projecting …

X-trans2cap: Cross-modal knowledge transfer using transformer for 3d dense captioning

Z Yuan, X Yan, Y Liao, Y Guo, G Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract 3D dense captioning aims to describe individual objects by natural language in 3D
scenes, where 3D scenes are usually represented as RGB-D scans or point clouds …

Depthcrafter: Generating consistent long depth sequences for open-world videos

W Hu, X Gao, X Li, S Zhao, X Cun, Y Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite significant advancements in monocular depth estimation for static images,
estimating video depth in the open world remains challenging, since open-world videos are …

Peal: Prior-embedded explicit attention learning for low-overlap point cloud registration

J Yu, L Ren, Y Zhang, W Zhou… - Proceedings of the …, 2023 - openaccess.thecvf.com
Learning distinctive point-wise features is critical for low-overlap point cloud registration.
Recently, it has achieved huge success in incorporating Transformer into point cloud feature …

Box2mask: Weakly supervised 3d semantic instance segmentation using bounding boxes

J Chibane, F Engelmann, T Anh Tran… - European conference on …, 2022 - Springer
Current 3D segmentation methods heavily rely on large-scale point-cloud datasets, which
are notoriously laborious to annotate. Few attempts have been made to circumvent the need …

Deep learning based 3D segmentation: A survey

Y He, H Yu, X Liu, Z Yang, W Sun, S Anwar… - arxiv preprint arxiv …, 2021 - arxiv.org
3D segmentation is a fundamental and challenging problem in computer vision with
applications in autonomous driving and robotics. It has received significant attention from the …