Segment anything is not always perfect: An investigation of sam on different real-world applications
Abstract Recently, Meta AI Research approaches a general, promptable segment anything
model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B) …
model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B) …
Zoom in and out: A mixed-scale triplet network for camouflaged object detection
The recently proposed camouflaged object detection (COD) attempts to segment objects that
are visually blended into their surroundings, which is extremely complex and difficult in real …
are visually blended into their surroundings, which is extremely complex and difficult in real …
HRTransNet: HRFormer-driven two-modality salient object detection
The High-Resolution Transformer (HRFormer) can maintain high-resolution representation
and share global receptive fields. It is friendly towards salient object detection (SOD) in …
and share global receptive fields. It is friendly towards salient object detection (SOD) in …
Multispectral video semantic segmentation: A benchmark dataset and baseline
Robust and reliable semantic segmentation in complex scenes is crucial for many real-life
applications such as autonomous safe driving and nighttime rescue. In most approaches, it …
applications such as autonomous safe driving and nighttime rescue. In most approaches, it …
DVSOD: RGB-D video salient object detection
Salient object detection (SOD) aims to identify standout elements in a scene, with recent
advancements primarily focused on integrating depth data (RGB-D) or temporal data from …
advancements primarily focused on integrating depth data (RGB-D) or temporal data from …
Learning content-enhanced mask transformer for domain generalized urban-scene segmentation
Domain-generalized urban-scene semantic segmentation (USSS) aims to learn generalized
semantic predictions across diverse urban-scene styles. Unlike generic domain gap …
semantic predictions across diverse urban-scene styles. Unlike generic domain gap …
Self-supervised pretraining for RGB-D salient object detection
Abstract Existing CNNs-Based RGB-D salient object detection (SOD) networks are all
required to be pretrained on the ImageNet to learn the hierarchy features which helps …
required to be pretrained on the ImageNet to learn the hierarchy features which helps …
Delving into calibrated depth for accurate rgb-d salient object detection
Recent years have witnessed growing interests in RGB-D Salient Object Detection (SOD),
benefiting from the ample spatial layout cues embedded in depth maps to help SOD models …
benefiting from the ample spatial layout cues embedded in depth maps to help SOD models …
All grains, one scheme (AGOS): Learning multigrain instance representation for aerial scene classification
Aerial scene classification remains challenging as: 1) the size of key objects in determining
the scene scheme varies greatly and 2) many objects irrelevant to the scene scheme are …
the scene scheme varies greatly and 2) many objects irrelevant to the scene scheme are …
Mutual information regularization for weakly-supervised RGB-D salient object detection
In this paper, we present a weakly-supervised RGB-D salient object detection model via
scribble supervision. Specifically, as a multimodal learning task, we focus on effective …
scribble supervision. Specifically, as a multimodal learning task, we focus on effective …