A survey on graph neural networks and graph transformers in computer vision: A task-oriented perspective
Graph Neural Networks (GNNs) have gained momentum in graph representation learning
and boosted the state of the art in a variety of areas, such as data mining (eg, social network …
and boosted the state of the art in a variety of areas, such as data mining (eg, social network …
Winclip: Zero-/few-shot anomaly classification and segmentation
Visual anomaly classification and segmentation are vital for automating industrial quality
inspection. The focus of prior research in the field has been on training custom models for …
inspection. The focus of prior research in the field has been on training custom models for …
Introducing language guidance in prompt-based continual learning
Continual Learning aims to learn a single model on a sequence of tasks without having
access to data from previous tasks. The biggest challenge in the domain still remains …
access to data from previous tasks. The biggest challenge in the domain still remains …
I2mvformer: Large language model generated multi-view document supervision for zero-shot image classification
Recent works have shown that unstructured text (documents) from online sources can serve
as useful auxiliary information for zero-shot image classification. However, these methods …
as useful auxiliary information for zero-shot image classification. However, these methods …
Chop & learn: Recognizing and generating object-state compositions
Recognizing and generating object-state compositions has been a challenging task,
especially when generalizing to unseen compositions. In this paper, we study the task of …
especially when generalizing to unseen compositions. In this paper, we study the task of …
Disentangling visual embeddings for attributes and objects
We study the problem of compositional zero-shot learning for object-attribute recognition.
Prior works use visual features extracted with a backbone network, pre-trained for object …
Prior works use visual features extracted with a backbone network, pre-trained for object …
I2dformer: Learning image to document attention for zero-shot image classification
Despite the tremendous progress in zero-shot learning (ZSL), the majority of existing
methods still rely on human-annotated attributes, which are difficult to annotate and scale …
methods still rely on human-annotated attributes, which are difficult to annotate and scale …
Decomposed soft prompt guided fusion enhancing for compositional zero-shot learning
Abstract Compositional Zero-Shot Learning (CZSL) aims to recognize novel concepts
formed by known states and objects during training. Existing methods either learn the …
formed by known states and objects during training. Existing methods either learn the …
Tsca: On the semantic consistency alignment via conditional transport for compositional zero-shot learning
Compositional Zero-Shot Learning (CZSL) aims to recognize novel\textit {state-object}
compositions by leveraging the shared knowledge of their primitive components. Despite …
compositions by leveraging the shared knowledge of their primitive components. Despite …
Learning attention as disentangler for compositional zero-shot learning
Compositional zero-shot learning (CZSL) aims at learning visual concepts (ie, attributes and
objects) from seen compositions and combining concept knowledge into unseen …
objects) from seen compositions and combining concept knowledge into unseen …