Deep learning for video object segmentation: a review

M Gao, F Zheng, JJQ Yu, C Shan, G Ding… - Artificial Intelligence …, 2023 - Springer
As one of the fundamental problems in the field of video understanding, video object
segmentation aims at segmenting objects of interest throughout the given video sequence …

Vision transformers for dense prediction: A survey

S Zuo, Y **ao, X Chang, X Wang - Knowledge-based systems, 2022 - Elsevier
Transformers have demonstrated impressive expressiveness and transfer capability in
computer vision fields. Dense prediction is a fundamental problem in computer vision that is …

Sam 2: Segment anything in images and videos

N Ravi, V Gabeur, YT Hu, R Hu, C Ryali, T Ma… - arxiv preprint arxiv …, 2024 - arxiv.org
We present Segment Anything Model 2 (SAM 2), a foundation model towards solving
promptable visual segmentation in images and videos. We build a data engine, which …

Tracking anything with decoupled video segmentation

HK Cheng, SW Oh, B Price… - Proceedings of the …, 2023 - openaccess.thecvf.com
Training data for video segmentation are expensive to annotate. This impedes extensions of
end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary …

Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model

HK Cheng, AG Schwing - European Conference on Computer Vision, 2022 - Springer
We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …

Track anything: Segment anything meets videos

J Yang, M Gao, Z Li, S Gao, F Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
Recently, the Segment Anything Model (SAM) gains lots of attention rapidly due to its
impressive segmentation performance on images. Regarding its strong ability on image …

Segment and track anything

Y Cheng, L Li, Y Xu, X Li, Z Yang, W Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
This report presents a framework called Segment And Track Anything (SAMTrack) that
allows users to precisely and effectively segment and track any object in a video …

Aiatrack: Attention in attention for transformer visual tracking

S Gao, C Zhou, C Ma, X Wang, J Yuan - European conference on …, 2022 - Springer
Transformer trackers have achieved impressive advancements recently, where the attention
mechanism plays an important role. However, the independent correlation computation in …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Putting the object back into video object segmentation

HK Cheng, SW Oh, B Price, JY Lee… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present Cutie a video object segmentation (VOS) network with object-level memory
reading which puts the object representation from memory back into the video object …