Promptad: Learning prompts with only normal samples for few-shot anomaly detection

X Li, Z Zhang, X Tan, C Chen, Y Qu… - Proceedings of the …, 2024 - openaccess.thecvf.com
The vision-language model has brought great improvement to few-shot industrial anomaly
detection which usually needs to design of hundreds of prompts through prompt …

Driveworld: 4d pre-trained scene understanding via world models for autonomous driving

C Min, D Zhao, L **ao, J Zhao, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Vision-centric autonomous driving has recently raised wide attention due to its lower cost.
Pre-training is essential for extracting a universal representation. However current vision …

Prompt-driven referring image segmentation with instance contrasting

C Shang, Z Song, H Qiu, L Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Referring image segmentation (RIS) aims to segment the target referent described by
natural language. Recently large-scale pre-trained models eg CLIP and SAM have been …

Simple semantic-aided few-shot learning

H Zhang, J Xu, S Jiang, Z He - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Learning from a limited amount of data namely Few-Shot Learning stands out as a
challenging computer vision task. Several works exploit semantics and design complicated …

Pathology-knowledge enhanced multi-instance prompt learning for few-shot whole slide image classification

L Qu, D Yang, D Huang, Q Guo, R Luo, S Zhang… - … on Computer Vision, 2024 - Springer
Current multi-instance learning algorithms for pathology image analysis often require a
substantial number of Whole Slide Images for effective training but exhibit suboptimal …

Prompt-and-transfer: Dynamic class-aware enhancement for few-shot segmentation

H Bi, Y Feng, W Diao, P Wang, Y Mao… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
For more efficient generalization to unseen domains (classes), most Few-shot Segmentation
(FSS) would directly exploit pre-trained encoders and only fine-tune the decoder, especially …

Bimodal semantic fusion prototypical network for few-shot classification

X Huang, SH Choi - Information Fusion, 2024 - Elsevier
Few-shot classification learns from a small number of image samples to recognize unseen
images. Recent few-shot learning exploits auxiliary text information, such as class labels …

Scattering attribute embedded network for few-shot sar atr

J Qin, B Zou, Y Chen, H Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Restricted by the deficient training samples, deep learning-based automatic target
recognition (ATR) methods for synthetic aperture radar (SAR) are prone to performance …

Envisioning class entity reasoning by large language models for few-shot learning

M Liu, F Wu, B Li, Z Lu, Y Yu, X Li - arxiv preprint arxiv:2408.12469, 2024 - arxiv.org
Few-shot learning (FSL) aims to recognize new concepts using a limited number of visual
samples. Existing approaches attempt to incorporate semantic information into the limited …

Concon-chi: Concept-context chimera benchmark for personalized vision-language tasks

A Rosasco, S Berti, G Pasquale… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract While recent Vision-Language (VL) models excel at open-vocabulary tasks it is
unclear how to use them with specific or uncommon concepts. Personalized Text-to-Image …