[HTML][HTML] Review of large vision models and visual prompt engineering
Visual prompt engineering is a fundamental methodology in the field of visual and image
artificial general intelligence. As the development of large vision models progresses, the …
artificial general intelligence. As the development of large vision models progresses, the …
A comprehensive survey of few-shot learning: Evolution, applications, challenges, and opportunities
Few-shot learning (FSL) has emerged as an effective learning method and shows great
potential. Despite the recent creative works in tackling FSL tasks, learning valid information …
potential. Despite the recent creative works in tackling FSL tasks, learning valid information …
Voxformer: Sparse voxel transformer for camera-based 3d semantic scene completion
Humans can easily imagine the complete 3D geometry of occluded objects and scenes. This
appealing ability is vital for recognition and understanding. To enable such capability in AI …
appealing ability is vital for recognition and understanding. To enable such capability in AI …
Rethinking semantic segmentation: A prototype view
Prevalent semantic segmentation solutions, despite their different network designs (FCN
based or attention based) and mask decoding strategies (parametric softmax based or pixel …
based or attention based) and mask decoding strategies (parametric softmax based or pixel …
Image segmentation using text and image prompts
Image segmentation is usually addressed by training a model for a fixed set of object
classes. Incorporating additional classes or more complex queries later is expensive as it …
classes. Incorporating additional classes or more complex queries later is expensive as it …
Language-driven semantic segmentation
We present LSeg, a novel model for language-driven semantic image segmentation. LSeg
uses a text encoder to compute embeddings of descriptive input labels (eg," grass" or" …
uses a text encoder to compute embeddings of descriptive input labels (eg," grass" or" …
Learning what not to segment: A new perspective on few-shot segmentation
Recently few-shot segmentation (FSS) has been extensively developed. Most previous
works strive to achieve generalization through the meta-learning framework derived from …
works strive to achieve generalization through the meta-learning framework derived from …
Universeg: Universal medical image segmentation
While deep learning models have become the predominant method for medical image
segmentation, they are typically not capable of generalizing to unseen segmentation tasks …
segmentation, they are typically not capable of generalizing to unseen segmentation tasks …
Hierarchical dense correlation distillation for few-shot segmentation
Few-shot semantic segmentation (FSS) aims to form class-agnostic models segmenting
unseen classes with only a handful of annotations. Previous methods limited to the semantic …
unseen classes with only a handful of annotations. Previous methods limited to the semantic …
Real-time vehicle detection based on improved yolo v5
Y Zhang, Z Guo, J Wu, Y Tian, H Tang, X Guo - Sustainability, 2022 - mdpi.com
To reduce the false detection rate of vehicle targets caused by occlusion, an improved
method of vehicle detection in different traffic scenarios based on an improved YOLO v5 …
method of vehicle detection in different traffic scenarios based on an improved YOLO v5 …