Convolutions die hard: Open-vocabulary segmentation with single frozen convolutional clip

Q Yu, J He, X Deng, X Shen… - Advances in Neural …, 2023 - proceedings.neurips.cc
Open-vocabulary segmentation is a challenging task requiring segmenting and recognizing
objects from an open set of categories in diverse environments. One way to address this …

Towards open vocabulary learning: A survey

J Wu, X Li, S Xu, H Yuan, H Ding… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …

A survey on open-vocabulary detection and segmentation: Past, present, and future

C Zhu, L Chen - IEEE Transactions on Pattern Analysis and …, 2024 - ieeexplore.ieee.org
As the most fundamental scene understanding tasks, object detection and segmentation
have made tremendous progress in deep learning era. Due to the expensive manual …

Sed: A simple encoder-decoder for open-vocabulary semantic segmentation

B **e, J Cao, J **e, FS Khan… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Open-vocabulary semantic segmentation strives to distinguish pixels into different semantic
groups from an open set of categories. Most existing methods explore utilizing pre-trained …

Explore the potential of clip for training-free open vocabulary semantic segmentation

T Shao, Z Tian, H Zhao, J Su - European Conference on Computer Vision, 2024 - Springer
CLIP, as a vision-language model, has significantly advanced Open-Vocabulary Semantic
Segmentation (OVSS) with its zero-shot capabilities. Despite its success, its application to …

Ov-parts: Towards open-vocabulary part segmentation

M Wei, X Yue, W Zhang, S Kong… - Advances in Neural …, 2024 - proceedings.neurips.cc
Segmenting and recognizing diverse object parts is a crucial ability in applications spanning
various computer vision and robotic tasks. While significant progress has been made in …

What a mess: Multi-domain evaluation of zero-shot semantic segmentation

B Blumenstiel, J Jakubik, H Kühne… - Advances in Neural …, 2024 - proceedings.neurips.cc
While semantic segmentation has seen tremendous improvements in the past, there are still
significant labeling efforts necessary and the problem of limited generalization to classes …

Silc: Improving vision language pretraining with self-distillation

MF Naeem, Y **an, X Zhai, L Hoyer, L Van Gool… - … on Computer Vision, 2024 - Springer
Image-Text pretraining on web-scale image caption datasets has become the default recipe
for open vocabulary classification and retrieval models thanks to the success of CLIP and its …

SemiVL: semi-supervised semantic segmentation with vision-language guidance

L Hoyer, DJ Tan, MF Naeem, L Van Gool… - European Conference on …, 2024 - Springer
In semi-supervised semantic segmentation, a model is trained with a limited number of
labeled images along with a large corpus of unlabeled images to reduce the high annotation …

Open-vocabulary semantic segmentation via attribute decomposition-aggregation

C Ma, Y Yuhuan, C Ju, F Zhang… - Advances in Neural …, 2024 - proceedings.neurips.cc
Open-vocabulary semantic segmentation is a challenging task that requires segmenting
novel object categories at inference time. Recent works explore vision-language pre-training …