Deep learning for video object segmentation: a review
As one of the fundamental problems in the field of video understanding, video object
segmentation aims at segmenting objects of interest throughout the given video sequence …
segmentation aims at segmenting objects of interest throughout the given video sequence …
Water body classification from high-resolution optical remote sensing imagery: Achievements and perspectives
Water body classification from high-resolution optical remote sensing (RS) images, aiming at
classifying whether each pixel of the image is water or not, has become a hot issue in the …
classifying whether each pixel of the image is water or not, has become a hot issue in the …
Segment anything in high quality
Abstract The recent Segment Anything Model (SAM) represents a big leap in scaling up
segmentation models, allowing for powerful zero-shot capabilities and flexible prompting …
segmentation models, allowing for powerful zero-shot capabilities and flexible prompting …
Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model
We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …
Tracking anything with decoupled video segmentation
Training data for video segmentation are expensive to annotate. This impedes extensions of
end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary …
end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary …
Transformer-based visual segmentation: A survey
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …
segments or groups. This technique has numerous real-world applications, such as …
St++: Make self-training work better for semi-supervised semantic segmentation
Self-training via pseudo labeling is a conventional, simple, and popular pipeline to leverage
unlabeled data. In this work, we first construct a strong baseline of self-training (namely ST) …
unlabeled data. In this work, we first construct a strong baseline of self-training (namely ST) …
Cost aggregation with 4d convolutional swin transformer for few-shot segmentation
This paper presents a novel cost aggregation network, called Volumetric Aggregation with
Transformers (VAT), for few-shot segmentation. The use of transformers can benefit …
Transformers (VAT), for few-shot segmentation. The use of transformers can benefit …
Putting the object back into video object segmentation
We present Cutie a video object segmentation (VOS) network with object-level memory
reading which puts the object representation from memory back into the video object …
reading which puts the object representation from memory back into the video object …
Matting anything
In this paper we propose the Matting Anything Model (MAM) an efficient and versatile
framework for estimating the alpha matte of any instance in an image with flexible and …
framework for estimating the alpha matte of any instance in an image with flexible and …