A survey on label-efficient deep image segmentation: Bridging the gap between weak supervision and dense prediction
The rapid development of deep learning has made a great progress in image segmentation,
one of the fundamental tasks of computer vision. However, the current segmentation …
one of the fundamental tasks of computer vision. However, the current segmentation …
Visual semantic segmentation based on few/zero-shot learning: An overview
Visual semantic segmentation aims at separating a visual sample into diverse blocks with
specific semantic attributes and identifying the category for each block, and it plays a crucial …
specific semantic attributes and identifying the category for each block, and it plays a crucial …
Zegclip: Towards adapting clip for zero-shot semantic segmentation
Recently, CLIP has been applied to pixel-level zero-shot learning tasks via a wo-stage
scheme. The general idea is to first generate class-agnostic region proposals and then feed …
scheme. The general idea is to first generate class-agnostic region proposals and then feed …
Diffumask: Synthesizing images with pixel-level annotations for semantic segmentation using diffusion models
Collecting and annotating images with pixel-wise labels is time-consuming and laborious. In
contrast, synthetic data can be freely available using a generative model (eg, DALL-E …
contrast, synthetic data can be freely available using a generative model (eg, DALL-E …
Feature 3dgs: Supercharging 3d gaussian splatting to enable distilled feature fields
Abstract 3D scene representations have gained immense popularity in recent years.
Methods that use Neural Radiance fields are versatile for traditional tasks such as novel …
Methods that use Neural Radiance fields are versatile for traditional tasks such as novel …
Towards open vocabulary learning: A survey
In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …
advancements in various core tasks like segmentation, tracking, and detection. However …
Datasetdm: Synthesizing data with perception annotations using diffusion models
Current deep networks are very data-hungry and benefit from training on large-scale
datasets, which are often time-consuming to collect and annotate. By contrast, synthetic data …
datasets, which are often time-consuming to collect and annotate. By contrast, synthetic data …
Image segmentation using text and image prompts
Image segmentation is usually addressed by training a model for a fixed set of object
classes. Incorporating additional classes or more complex queries later is expensive as it …
classes. Incorporating additional classes or more complex queries later is expensive as it …
Extract free dense labels from clip
Abstract Contrastive Language-Image Pre-training (CLIP) has made a remarkable
breakthrough in open-vocabulary zero-shot image recognition. Many recent studies …
breakthrough in open-vocabulary zero-shot image recognition. Many recent studies …
Pla: Language-driven open-vocabulary 3d scene understanding
Open-vocabulary scene understanding aims to localize and recognize unseen categories
beyond the annotated label space. The recent breakthrough of 2D open-vocabulary …
beyond the annotated label space. The recent breakthrough of 2D open-vocabulary …