- Academic Search

Gem Citer Citeret af 170 Relaterede artikler Alle 6 versioner Vis som HTML

Neural 3d scene reconstruction with the manhattan-world assumption

H Guo, S Peng, H Lin, Q Wang… - Proceedings of the …, 2022 - openaccess.thecvf.com

This paper addresses the challenge of reconstructing 3D indoor scenes from multi-view
images. Many previous works have shown impressive reconstruction results on textured …

Gem Citer Citeret af 205 Relaterede artikler Alle 7 versioner Vis som HTML

Fast point transformer

C Park, Y Jeong, M Cho, J Park - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

The recent success of neural networks enables a better interpretation of 3D point clouds, but
processing a large-scale 3D scene remains a challenging problem. Most current …

Gem Citer Citeret af 251 Relaterede artikler Alle 7 versioner

CMX: Cross-modal fusion for RGB-X semantic segmentation with transformers

J Zhang, H Liu, K Yang, X Hu, R Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Scene understanding based on image segmentation is a crucial component of autonomous
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …

Gem Citer Citeret af 81 Relaterede artikler Alle 9 versioner Vis som HTML

Learning multi-view aggregation in the wild for large-scale 3d semantic segmentation

D Robert, B Vallet, L Landrieu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Recent works on 3D semantic segmentation propose to exploit the synergy between images
and point clouds by processing each modality with a dedicated network and projecting …

Gem Citer Citeret af 92 Relaterede artikler Alle 7 versioner Vis som HTML

X-trans2cap: Cross-modal knowledge transfer using transformer for 3d dense captioning

Z Yuan, X Yan, Y Liao, Y Guo, G Li… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract 3D dense captioning aims to describe individual objects by natural language in 3D
scenes, where 3D scenes are usually represented as RGB-D scans or point clouds …

Gem Citer Citeret af 26 Relaterede artikler Alle 4 versioner Vis som HTML

Depthcrafter: Generating consistent long depth sequences for open-world videos

W Hu, X Gao, X Li, S Zhao, X Cun, Y Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

Despite significant advancements in monocular depth estimation for static images,
estimating video depth in the open world remains challenging, since open-world videos are …

Gem Citer Citeret af 50 Relaterede artikler Alle 3 versioner Vis som HTML

Peal: Prior-embedded explicit attention learning for low-overlap point cloud registration

J Yu, L Ren, Y Zhang, W Zhou… - Proceedings of the …, 2023 - openaccess.thecvf.com

Learning distinctive point-wise features is critical for low-overlap point cloud registration.
Recently, it has achieved huge success in incorporating Transformer into point cloud feature …

Gem Citer Citeret af 65 Relaterede artikler Alle 9 versioner

Box2mask: Weakly supervised 3d semantic instance segmentation using bounding boxes

J Chibane, F Engelmann, T Anh Tran… - European conference on …, 2022 - Springer

Current 3D segmentation methods heavily rely on large-scale point-cloud datasets, which
are notoriously laborious to annotate. Few attempts have been made to circumvent the need …