What does clip know about a red circle? visual prompt engineering for vlms
Abstract Large-scale Vision-Language Models, such as CLIP, learn powerful image-text
representations that have found numerous applications, from zero-shot classification to text …
representations that have found numerous applications, from zero-shot classification to text …
No representation rules them all in category discovery
In this paper we tackle the problem of Generalized Category Discovery (GCD). Specifically,
given a dataset with labelled and unlabelled images, the task is to cluster all images in the …
given a dataset with labelled and unlabelled images, the task is to cluster all images in the …
A systematic survey of prompt engineering on vision-language foundation models
Prompt engineering is a technique that involves augmenting a large pre-trained model with
task-specific hints, known as prompts, to adapt the model to new tasks. Prompts can be …
task-specific hints, known as prompts, to adapt the model to new tasks. Prompts can be …
Open-world machine learning: A review and new outlooks
Machine learning has achieved remarkable success in many applications. However,
existing studies are largely based on the closed-world assumption, which assumes that the …
existing studies are largely based on the closed-world assumption, which assumes that the …
Learning semi-supervised gaussian mixture models for generalized category discovery
In this paper, we address the problem of generalized category discovery (GCD), ie, given a
set of images where part of them are labelled and the rest are not, the task is to automatically …
set of images where part of them are labelled and the rest are not, the task is to automatically …
Active generalized category discovery
Abstract Generalized Category Discovery (GCD) is a pragmatic and challenging open-world
task which endeavors to cluster unlabeled samples from both novel and old classes …
task which endeavors to cluster unlabeled samples from both novel and old classes …
Textual knowledge matters: Cross-modality co-teaching for generalized visual class discovery
In this paper, we study the problem of Generalized Category Discovery (GCD), which aims to
cluster unlabeled data from both known and unknown categories using the knowledge of …
cluster unlabeled data from both known and unknown categories using the knowledge of …
Learn to categorize or categorize to learn? self-coding for generalized category discovery
In the quest for unveiling novel categories at test time, we confront the inherent limitations of
traditional supervised recognition models that are restricted by a predefined category set …
traditional supervised recognition models that are restricted by a predefined category set …
Labeled data selection for category discovery
Visual category discovery methods aim to find novel categories in unlabeled visual data. At
training time, a set of labeled and unlabeled images are provided, where the labels …
training time, a set of labeled and unlabeled images are provided, where the labels …
Prediction consistency regularization for generalized category discovery
Abstract Generalized Category Discovery (GCD) is a recently proposed open-world problem
that aims to automatically discover and cluster based on partially labeled data. The …
that aims to automatically discover and cluster based on partially labeled data. The …