Towards open vocabulary learning: A survey
In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …
advancements in various core tasks like segmentation, tracking, and detection. However …
A survey on open-vocabulary detection and segmentation: Past, present, and future
As the most fundamental scene understanding tasks, object detection and segmentation
have made tremendous progress in deep learning era. Due to the expensive manual …
have made tremendous progress in deep learning era. Due to the expensive manual …
Univs: Unified and universal video segmentation with prompts as queries
Despite the recent advances in unified image segmentation (IS) develo** a unified video
segmentation (VS) model remains a challenge. This is mainly because generic category …
segmentation (VS) model remains a challenge. This is mainly because generic category …
Unified embedding alignment for open-vocabulary video instance segmentation
Abstract Open-Vocabulary Video Instance Segmentation (VIS) is attracting increasing
attention due to its ability to segment and track arbitrary objects. However, the recent Open …
attention due to its ability to segment and track arbitrary objects. However, the recent Open …
TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Multimodal fusion, leveraging data like vision and language, is rapidly gaining traction. This
enriched data representation improves performance across various tasks. Existing methods …
enriched data representation improves performance across various tasks. Existing methods …
Panovos: Bridging non-panoramic and panoramic views with transformer for video segmentation
Panoramic videos contain richer spatial information and have attracted tremendous amounts
of attention due to their exceptional experience in some fields such as autonomous driving …
of attention due to their exceptional experience in some fields such as autonomous driving …
Learning the What and How of Annotation in Video Object Segmentation
Abstract Video Object Segmentation (VOS) is crucial for several applications, from video
editing to video data generation. Training a VOS model requires an abundance of manually …
editing to video data generation. Training a VOS model requires an abundance of manually …
X-prompt: Multi-modal visual prompt for video object segmentation
Multi-modal Video Object Segmentation (VOS), including RGB-Thermal, RGB-Depth, and
RGB-Event, has garnered attention due to its capability to address challenging scenarios …
RGB-Event, has garnered attention due to its capability to address challenging scenarios …
Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation
Temporally locating objects with arbitrary class texts is the primary pursuit of open-
vocabulary Video Instance Segmentation (VIS). Because of the insufficient vocabulary of …
vocabulary Video Instance Segmentation (VIS). Because of the insufficient vocabulary of …
Towards Decision-based Sparse Attacks on Video Recognition
Recent studies indicate that sparse attacks threaten the security of deep learning models,
which modify only a small set of pixels in the input based on the l0 norm constraint. While …
which modify only a small set of pixels in the input based on the l0 norm constraint. While …