- Academic Search

Spara Citera Citerat av 397 Relaterade artiklar Alla 2 versionerna Se som HTML-version

Sam 2: Segment anything in images and videos

N Ravi, V Gabeur, YT Hu, R Hu, C Ryali, T Ma… - arxiv preprint arxiv …, 2024 - arxiv.org

We present Segment Anything Model 2 (SAM 2), a foundation model towards solving
promptable visual segmentation in images and videos. We build a data engine, which …

Spara Citera Citerat av 166 Relaterade artiklar Alla 5 versionerna Se som HTML-version

Universal instance perception as object discovery and retrieval

B Yan, Y Jiang, J Wu, D Wang, P Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com

All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …

Spara Citera Citerat av 402 Relaterade artiklar Alla 8 versionerna

Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model

HK Cheng, AG Schwing - European Conference on Computer Vision, 2022 - Springer

We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …

Spara Citera Citerat av 221 Relaterade artiklar Alla 2 versionerna Se som HTML-version

Segment and track anything

Y Cheng, L Li, Y Xu, X Li, Z Yang, W Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

This report presents a framework called Segment And Track Anything (SAMTrack) that
allows users to precisely and effectively segment and track any object in a video …

Spara Citera Citerat av 119 Relaterade artiklar Alla 7 versionerna Se som HTML-version

MOSE: A new dataset for video object segmentation in complex scenes

H Ding, C Liu, S He, X Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Video object segmentation (VOS) aims at segmenting a particular object throughout the
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …

Spara Citera Citerat av 44 Relaterade artiklar Alla 7 versionerna

Visual semantic segmentation based on few/zero-shot learning: An overview

W Ren, Y Tang, Q Sun, C Zhao… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org

Visual semantic segmentation aims at separating a visual sample into diverse blocks with
specific semantic attributes and identifying the category for each block, and it plays a crucial …

Spara Citera Citerat av 343 Relaterade artiklar Alla 10 versionerna Se som HTML-version

Lavt: Language-aware vision transformer for referring image segmentation

Z Yang, J Wang, Y Tang, K Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com

Referring image segmentation is a fundamental vision-language task that aims to segment
out an object referred to by a natural language expression from an image. One of the key …

Spara Citera Citerat av 108 Relaterade artiklar Alla 6 versionerna Se som HTML-version

Dropmae: Masked autoencoders with spatial-attention dropout for tracking tasks

Q Wu, T Yang, Z Liu, B Wu, Y Shan… - Proceedings of the …, 2023 - openaccess.thecvf.com

In this paper, we study masked autoencoder (MAE) pretraining on videos for matching-
based downstream tasks, including visual object tracking (VOT) and video object …