Segment anything is not always perfect: An investigation of sam on different real-world applications

W Ji, J Li, Q Bi, T Liu, W Li, L Cheng - 2024 - Springer
Abstract Recently, Meta AI Research approaches a general, promptable segment anything
model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B) …

Fast segment anything

X Zhao, W Ding, Y An, Y Du, T Yu, M Li, M Tang… - arxiv preprint arxiv …, 2023 - arxiv.org
The recently proposed segment anything model (SAM) has made a significant influence in
many computer vision tasks. It is becoming a foundation step for many high-level tasks, like …

Proposal-based multiple instance learning for weakly-supervised temporal action localization

H Ren, W Yang, T Zhang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Weakly-supervised temporal action localization aims to localize and recognize actions in
untrimmed videos with only video-level category labels during training. Without instance …

Vectorized evidential learning for weakly-supervised temporal action localization

J Gao, M Chen, C Xu - IEEE transactions on pattern analysis …, 2023 - ieeexplore.ieee.org
With the explosive growth of videos, weakly-supervised temporal action localization (WS-
TAL) task has become a promising research direction in pattern analysis and machine …

Multispectral video semantic segmentation: A benchmark dataset and baseline

W Ji, J Li, C Bian, Z Zhou, J Zhao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Robust and reliable semantic segmentation in complex scenes is crucial for many real-life
applications such as autonomous safe driving and nighttime rescue. In most approaches, it …

DVSOD: RGB-D video salient object detection

J Li, W Ji, S Wang, W Li - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Salient object detection (SOD) aims to identify standout elements in a scene, with recent
advancements primarily focused on integrating depth data (RGB-D) or temporal data from …

Boosting weakly-supervised temporal action localization with text information

G Li, D Cheng, X Ding, N Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Due to the lack of temporal annotation, current Weakly-supervised Temporal Action
Localization (WTAL) methods are generally stuck into over-complete or incomplete …

Distilling vision-language pre-training to collaborate with weakly-supervised temporal action localization

C Ju, K Zheng, J Liu, P Zhao, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Weakly-supervised temporal action localization (WTAL) learns to detect and classify action
instances with only category labels. Most methods widely adopt the off-the-shelf …

DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization

X Tang, J Fan, C Luo, Z Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Weakly-supervised temporal action localization (WTAL) is a practical yet challenging task.
Due to large-scale datasets, most existing methods use a network pretrained in other …

Actionness inconsistency-guided contrastive learning for weakly-supervised temporal action localization

Z Li, Z Wang, Q Liu - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
Weakly-supervised temporal action localization (WTAL) aims to detect action instances
given only video-level labels. To address the challenge, recent methods commonly employ …