[HTML][HTML] Review of large vision models and visual prompt engineering

J Wang, Z Liu, L Zhao, Z Wu, C Ma, S Yu, H Dai… - Meta-Radiology, 2023 - Elsevier
Visual prompt engineering is a fundamental methodology in the field of visual and image
artificial general intelligence. As the development of large vision models progresses, the …

A comprehensive survey of few-shot learning: Evolution, applications, challenges, and opportunities

Y Song, T Wang, P Cai, SK Mondal… - ACM Computing Surveys, 2023 - dl.acm.org
Few-shot learning (FSL) has emerged as an effective learning method and shows great
potential. Despite the recent creative works in tackling FSL tasks, learning valid information …

Voxformer: Sparse voxel transformer for camera-based 3d semantic scene completion

Y Li, Z Yu, C Choy, C **ao, JM Alvarez… - Proceedings of the …, 2023 - openaccess.thecvf.com
Humans can easily imagine the complete 3D geometry of occluded objects and scenes. This
appealing ability is vital for recognition and understanding. To enable such capability in AI …

Rethinking semantic segmentation: A prototype view

T Zhou, W Wang, E Konukoglu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Prevalent semantic segmentation solutions, despite their different network designs (FCN
based or attention based) and mask decoding strategies (parametric softmax based or pixel …

Image segmentation using text and image prompts

T Lüddecke, A Ecker - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Image segmentation is usually addressed by training a model for a fixed set of object
classes. Incorporating additional classes or more complex queries later is expensive as it …

Language-driven semantic segmentation

B Li, KQ Weinberger, S Belongie, V Koltun… - arxiv preprint arxiv …, 2022 - arxiv.org
We present LSeg, a novel model for language-driven semantic image segmentation. LSeg
uses a text encoder to compute embeddings of descriptive input labels (eg," grass" or" …

Learning what not to segment: A new perspective on few-shot segmentation

C Lang, G Cheng, B Tu, J Han - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Recently few-shot segmentation (FSS) has been extensively developed. Most previous
works strive to achieve generalization through the meta-learning framework derived from …

Universeg: Universal medical image segmentation

VI Butoi, JJG Ortiz, T Ma, MR Sabuncu… - Proceedings of the …, 2023 - openaccess.thecvf.com
While deep learning models have become the predominant method for medical image
segmentation, they are typically not capable of generalizing to unseen segmentation tasks …

Hierarchical dense correlation distillation for few-shot segmentation

B Peng, Z Tian, X Wu, C Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Few-shot semantic segmentation (FSS) aims to form class-agnostic models segmenting
unseen classes with only a handful of annotations. Previous methods limited to the semantic …

Real-time vehicle detection based on improved yolo v5

Y Zhang, Z Guo, J Wu, Y Tian, H Tang, X Guo - Sustainability, 2022 - mdpi.com
To reduce the false detection rate of vehicle targets caused by occlusion, an improved
method of vehicle detection in different traffic scenarios based on an improved YOLO v5 …