- Academic Search

N Ravi, V Gabeur, YT Hu, R Hu, C Ryali, T Ma… - arxiv preprint arxiv …, 2024 - arxiv.org

We present Segment Anything Model 2 (SAM 2), a foundation model towards solving
promptable visual segmentation in images and videos. We build a data engine, which …

Enregistrer Citer Cité 385 fois Autres articles Version HTML

[Free GPT-4]

[PDF] thecvf.com

Anydoor: Zero-shot object-level image customization

X Chen, L Huang, Y Liu, Y Shen… - Proceedings of the …, 2024 - openaccess.thecvf.com

This work presents AnyDoor a diffusion-based image generator with the power to teleport
target objects to new scenes at user-specified locations with desired shapes. Instead of …

Enregistrer Citer Cité 212 fois Autres articles Les 3 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] thecvf.com

Sequential modeling enables scalable learning for large vision models

Y Bai, X Geng, K Mangalam, A Bar… - Proceedings of the …, 2024 - openaccess.thecvf.com

We introduce a novel sequential modeling approach which enables learning a Large Vision
Model (LVM) without making use of any linguistic data. To do this we define a common …

Enregistrer Citer Cité 142 fois Autres articles Les 3 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] thecvf.com

Tracking anything with decoupled video segmentation

HK Cheng, SW Oh, B Price… - Proceedings of the …, 2023 - openaccess.thecvf.com

Training data for video segmentation are expensive to annotate. This impedes extensions of
end-to-end algorithms to new video segmentation tasks, especially in large-vocabulary …

Enregistrer Citer Cité 140 fois Autres articles Les 7 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] ieee.org

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Enregistrer Citer Cité 119 fois Autres articles Les 3 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

MOSE: A new dataset for video object segmentation in complex scenes

H Ding, C Liu, S He, X Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Video object segmentation (VOS) aims at segmenting a particular object throughout the
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …

Enregistrer Citer Cité 119 fois Autres articles Les 7 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] thecvf.com

OMG-Seg: Is one model good enough for all segmentation?

X Li, H Yuan, W Li, H Ding, S Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com

In this work we address various segmentation tasks each traditionally tackled by distinct or
partially unified models. We propose OMG-Seg One Model that is Good enough to efficiently …

Enregistrer Citer Cité 44 fois Autres articles Les 3 versions Free GPT-4 Version HTML

Draganything: Motion control for anything using entity representation

W Wu, Z Li, Y Gu, R Zhao, Y He, DJ Zhang… - … on Computer Vision, 2024 - Springer

We introduce DragAnything, which utilizes a entity representation to achieve motion control
for any object in controllable video generation. Comparison to existing motion control …

Enregistrer Citer Cité 31 fois Autres articles Les 2 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Tube-Link: A flexible cross tube framework for universal video segmentation

X Li, H Yuan, W Zhang, G Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com

Video segmentation aims to segment and track every pixel in diverse scenarios accurately.
In this paper, we present Tube-Link, a versatile framework that addresses multiple core tasks …

Enregistrer Citer Cité 49 fois Autres articles Les 5 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] thecvf.com

Video k-net: A simple, strong, and unified baseline for video segmentation

X Li, W Zhang, J Pang, K Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com

This paper presents Video K-Net, a simple, strong, and unified framework for fully end-to-
end video panoptic segmentation. The method is built upon K-Net, a method that unifies …

Enregistrer Citer Cité 100 fois Autres articles Les 6 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Large-scale video panoptic segmentation in the wild: A benchmark

Sam 2: Segment anything in images and videos

Anydoor: Zero-shot object-level image customization

Sequential modeling enables scalable learning for large vision models

Tracking anything with decoupled video segmentation

Transformer-based visual segmentation: A survey

MOSE: A new dataset for video object segmentation in complex scenes

OMG-Seg: Is one model good enough for all segmentation?

Draganything: Motion control for anything using entity representation

Tube-Link: A flexible cross tube framework for universal video segmentation

Video k-net: A simple, strong, and unified baseline for video segmentation