A review of generalized zero-shot learning methods
Generalized zero-shot learning (GZSL) aims to train a model for classifying data samples
under the condition that some output classes are unknown during supervised learning. To …
under the condition that some output classes are unknown during supervised learning. To …
Openscene: 3d scene understanding with open vocabularies
Traditional 3D scene understanding approaches rely on labeled 3D datasets to train a
model for a single task with supervision. We propose OpenScene, an alternative approach …
model for a single task with supervision. We propose OpenScene, an alternative approach …
Pla: Language-driven open-vocabulary 3d scene understanding
Open-vocabulary scene understanding aims to localize and recognize unseen categories
beyond the annotated label space. The recent breakthrough of 2D open-vocabulary …
beyond the annotated label space. The recent breakthrough of 2D open-vocabulary …
CLIP2: Contrastive language-image-point pretraining from real-world point cloud data
Abstract Contrastive Language-Image Pre-training, benefiting from large-scale unlabeled
text-image pairs, has demonstrated great performance in open-world vision understanding …
text-image pairs, has demonstrated great performance in open-world vision understanding …
Semantic-aware knowledge distillation for few-shot class-incremental learning
Few-shot class incremental learning (FSCIL) portrays the problem of learning new concepts
gradually, where only a few examples per concept are available to the learner. Due to the …
gradually, where only a few examples per concept are available to the learner. Due to the …
Clip-fo3d: Learning free open-world 3d scene representations from 2d dense clip
Training a 3D scene understanding model requires complicated human annotations, which
are laborious to collect and result in a model only encoding close-set object semantics. In …
are laborious to collect and result in a model only encoding close-set object semantics. In …
Regionplc: Regional point-language contrastive learning for open-world 3d scene understanding
We propose a lightweight and scalable Regional Point-Language Contrastive learning
framework namely RegionPLC for open-world 3D scene understanding aiming to identify …
framework namely RegionPLC for open-world 3D scene understanding aiming to identify …
See more and know more: Zero-shot point cloud segmentation via multi-modal visual data
Zero-shot point cloud segmentation aims to make deep models capable of recognizing
novel objects in point cloud that are unseen in the training phase. Recent trends favor the …
novel objects in point cloud that are unseen in the training phase. Recent trends favor the …
Synthesized feature based few-shot class-incremental learning on a mixture of subspaces
Few-shot class incremental learning (FSCIL) aims to incrementally add sets of novel classes
to a well-trained base model in multiple training sessions with the restriction that only a few …
to a well-trained base model in multiple training sessions with the restriction that only a few …
Lowis3d: Language-driven open-world instance-level 3d scene understanding
Open-world instance-level scene understanding aims to locate and recognize unseen object
categories that are not present in the annotated dataset. This task is challenging because …
categories that are not present in the annotated dataset. This task is challenging because …