Rethinking range view representation for lidar segmentation
LiDAR segmentation is crucial for autonomous driving perception. Recent trends favor point-
or voxel-based methods as they often yield better performance than the traditional range …
or voxel-based methods as they often yield better performance than the traditional range …
Openshape: Scaling up 3d shape representation towards open-world understanding
We introduce OpenShape, a method for learning multi-modal joint representations of text,
image, and point clouds. We adopt the commonly used multi-modal contrastive learning …
image, and point clouds. We adopt the commonly used multi-modal contrastive learning …
Towards open vocabulary learning: A survey
In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …
advancements in various core tasks like segmentation, tracking, and detection. However …
Unsupervised 3d perception with 2d vision-language distillation for autonomous driving
Closed-set 3D perception models trained on only a pre-defined set of object categories can
be inadequate for safety critical applications such as autonomous driving where new object …
be inadequate for safety critical applications such as autonomous driving where new object …
Multi-modal data-efficient 3d scene understanding for autonomous driving
Efficient data utilization is crucial for advancing 3D scene understanding in autonomous
driving, where reliance on heavily human-annotated LiDAR point clouds challenges fully …
driving, where reliance on heavily human-annotated LiDAR point clouds challenges fully …
Clip-fo3d: Learning free open-world 3d scene representations from 2d dense clip
Training a 3D scene understanding model requires complicated human annotations, which
are laborious to collect and result in a model only encoding close-set object semantics. In …
are laborious to collect and result in a model only encoding close-set object semantics. In …
See more and know more: Zero-shot point cloud segmentation via multi-modal visual data
Zero-shot point cloud segmentation aims to make deep models capable of recognizing
novel objects in point cloud that are unseen in the training phase. Recent trends favor the …
novel objects in point cloud that are unseen in the training phase. Recent trends favor the …
Weakly supervised 3d open-vocabulary segmentation
Open-vocabulary segmentation of 3D scenes is a fundamental function of human perception
and thus a crucial objective in computer vision research. However, this task is heavily …
and thus a crucial objective in computer vision research. However, this task is heavily …
A survey on deep learning based segmentation, detection and classification for 3D point clouds
The computer vision, graphics, and machine learning research groups have given a
significant amount of focus to 3D object recognition (segmentation, detection, and …
significant amount of focus to 3D object recognition (segmentation, detection, and …
Openins3d: Snap and lookup for 3d open-vocabulary instance segmentation
In this work, we introduce OpenIns3D, a new 3D-input-only framework for 3D open-
vocabulary scene understanding. The OpenIns3D framework employs a “Mask-Snap …
vocabulary scene understanding. The OpenIns3D framework employs a “Mask-Snap …