Promptad: Learning prompts with only normal samples for few-shot anomaly detection
The vision-language model has brought great improvement to few-shot industrial anomaly
detection which usually needs to design of hundreds of prompts through prompt …
detection which usually needs to design of hundreds of prompts through prompt …
Driveworld: 4d pre-trained scene understanding via world models for autonomous driving
Vision-centric autonomous driving has recently raised wide attention due to its lower cost.
Pre-training is essential for extracting a universal representation. However current vision …
Pre-training is essential for extracting a universal representation. However current vision …
Prompt-driven referring image segmentation with instance contrasting
Referring image segmentation (RIS) aims to segment the target referent described by
natural language. Recently large-scale pre-trained models eg CLIP and SAM have been …
natural language. Recently large-scale pre-trained models eg CLIP and SAM have been …
Simple semantic-aided few-shot learning
Learning from a limited amount of data namely Few-Shot Learning stands out as a
challenging computer vision task. Several works exploit semantics and design complicated …
challenging computer vision task. Several works exploit semantics and design complicated …
Pathology-knowledge enhanced multi-instance prompt learning for few-shot whole slide image classification
Current multi-instance learning algorithms for pathology image analysis often require a
substantial number of Whole Slide Images for effective training but exhibit suboptimal …
substantial number of Whole Slide Images for effective training but exhibit suboptimal …
Prompt-and-transfer: Dynamic class-aware enhancement for few-shot segmentation
For more efficient generalization to unseen domains (classes), most Few-shot Segmentation
(FSS) would directly exploit pre-trained encoders and only fine-tune the decoder, especially …
(FSS) would directly exploit pre-trained encoders and only fine-tune the decoder, especially …
Bimodal semantic fusion prototypical network for few-shot classification
Few-shot classification learns from a small number of image samples to recognize unseen
images. Recent few-shot learning exploits auxiliary text information, such as class labels …
images. Recent few-shot learning exploits auxiliary text information, such as class labels …
Scattering attribute embedded network for few-shot sar atr
J Qin, B Zou, Y Chen, H Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Restricted by the deficient training samples, deep learning-based automatic target
recognition (ATR) methods for synthetic aperture radar (SAR) are prone to performance …
recognition (ATR) methods for synthetic aperture radar (SAR) are prone to performance …
Envisioning class entity reasoning by large language models for few-shot learning
Few-shot learning (FSL) aims to recognize new concepts using a limited number of visual
samples. Existing approaches attempt to incorporate semantic information into the limited …
samples. Existing approaches attempt to incorporate semantic information into the limited …
Concon-chi: Concept-context chimera benchmark for personalized vision-language tasks
Abstract While recent Vision-Language (VL) models excel at open-vocabulary tasks it is
unclear how to use them with specific or uncommon concepts. Personalized Text-to-Image …
unclear how to use them with specific or uncommon concepts. Personalized Text-to-Image …