OMG-Seg: Is one model good enough for all segmentation?
In this work we address various segmentation tasks each traditionally tackled by distinct or
partially unified models. We propose OMG-Seg One Model that is Good enough to efficiently …
partially unified models. We propose OMG-Seg One Model that is Good enough to efficiently …
Towards open vocabulary learning: A survey
In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …
advancements in various core tasks like segmentation, tracking, and detection. However …
Reference twice: A simple and unified baseline for few-shot instance segmentation
Few-Shot Instance Segmentation (FSIS) requires detecting and segmenting novel classes
with limited support examples. Existing methods based on Region Proposal Networks …
with limited support examples. Existing methods based on Region Proposal Networks …
Clim: Contrastive language-image mosaic for region representation
Detecting objects accurately from a large or open vocabulary necessitates the vision-
language alignment on region representations. However, learning such a region-text …
language alignment on region representations. However, learning such a region-text …
[HTML][HTML] Ov-vg: A benchmark for open-vocabulary visual grounding
Open-vocabulary learning has emerged as a cutting-edge research area, particularly in light
of the widespread adoption of vision-based foundational models. Its primary objective is to …
of the widespread adoption of vision-based foundational models. Its primary objective is to …
Rethinking evaluation metrics of open-vocabulary segmentaion
In this paper, we highlight a problem of evaluation metrics adopted in the open-vocabulary
segmentation. That is, the evaluation process still heavily relies on closed-set metrics on …
segmentation. That is, the evaluation process still heavily relies on closed-set metrics on …
Ov-dquo: Open-vocabulary detr with denoising text query training and open-world unknown objects supervision
J Wang, B Chen, B Kang, Y Li, YC Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
Open-vocabulary detection aims to detect objects from novel categories beyond the base
categories on which the detector is trained. However, existing open-vocabulary detectors …
categories on which the detector is trained. However, existing open-vocabulary detectors …
VG4D: Vision-Language Model Goes 4D Video Recognition
Understanding the real world through point cloud video is a crucial aspect of robotics and
autonomous driving systems. However, prevailing methods for 4D point cloud recognition …
autonomous driving systems. However, prevailing methods for 4D point cloud recognition …
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
Open-vocabulary detection (OVD) aims to detect objects beyond a predefined set of
categories. As a pioneering model incorporating the YOLO series into OVD, YOLO-World is …
categories. As a pioneering model incorporating the YOLO series into OVD, YOLO-World is …
Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection
Enabling models to recognize vast open-world categories has been a longstanding pursuit
in object detection. By leveraging the generalization capabilities of vision-language models …
in object detection. By leveraging the generalization capabilities of vision-language models …