Segment anything is not always perfect: An investigation of sam on different real-world applications
Abstract Recently, Meta AI Research approaches a general, promptable segment anything
model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B) …
model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B) …
Fast segment anything
The recently proposed segment anything model (SAM) has made a significant influence in
many computer vision tasks. It is becoming a foundation step for many high-level tasks, like …
many computer vision tasks. It is becoming a foundation step for many high-level tasks, like …
Proposal-based multiple instance learning for weakly-supervised temporal action localization
Weakly-supervised temporal action localization aims to localize and recognize actions in
untrimmed videos with only video-level category labels during training. Without instance …
untrimmed videos with only video-level category labels during training. Without instance …
Vectorized evidential learning for weakly-supervised temporal action localization
With the explosive growth of videos, weakly-supervised temporal action localization (WS-
TAL) task has become a promising research direction in pattern analysis and machine …
TAL) task has become a promising research direction in pattern analysis and machine …
Multispectral video semantic segmentation: A benchmark dataset and baseline
Robust and reliable semantic segmentation in complex scenes is crucial for many real-life
applications such as autonomous safe driving and nighttime rescue. In most approaches, it …
applications such as autonomous safe driving and nighttime rescue. In most approaches, it …
DVSOD: RGB-D video salient object detection
Salient object detection (SOD) aims to identify standout elements in a scene, with recent
advancements primarily focused on integrating depth data (RGB-D) or temporal data from …
advancements primarily focused on integrating depth data (RGB-D) or temporal data from …
Boosting weakly-supervised temporal action localization with text information
Due to the lack of temporal annotation, current Weakly-supervised Temporal Action
Localization (WTAL) methods are generally stuck into over-complete or incomplete …
Localization (WTAL) methods are generally stuck into over-complete or incomplete …
Distilling vision-language pre-training to collaborate with weakly-supervised temporal action localization
Weakly-supervised temporal action localization (WTAL) learns to detect and classify action
instances with only category labels. Most methods widely adopt the off-the-shelf …
instances with only category labels. Most methods widely adopt the off-the-shelf …
DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization
Weakly-supervised temporal action localization (WTAL) is a practical yet challenging task.
Due to large-scale datasets, most existing methods use a network pretrained in other …
Due to large-scale datasets, most existing methods use a network pretrained in other …
Actionness inconsistency-guided contrastive learning for weakly-supervised temporal action localization
Weakly-supervised temporal action localization (WTAL) aims to detect action instances
given only video-level labels. To address the challenge, recent methods commonly employ …
given only video-level labels. To address the challenge, recent methods commonly employ …