Universal instance perception as object discovery and retrieval

B Yan, Y Jiang, J Wu, D Wang, P Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com
All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

MeViS: A large-scale benchmark for video segmentation with motion expressions

H Ding, C Liu, S He, X Jiang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
This paper strives for motion expressions guided video segmentation, which focuses on
segmenting objects in video content based on a sentence describing the motion of the …

General object foundation model for images and videos at scale

J Wu, Y Jiang, Q Liu, Z Yuan… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present GLEE in this work an object-level foundation model for locating and identifying
objects in images and videos. Through a unified framework GLEEaccomplishes detection …

Decoupling static and hierarchical motion perception for referring video segmentation

S He, H Ding - Proceedings of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Referring video segmentation relies on natural language expressions to identify and
segment objects often emphasizing motion clues. Previous works treat a sentence as a …

Tube-link: A flexible cross tube framework for universal video segmentation

X Li, H Yuan, W Zhang, G Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video segmentation aims to segment and track every pixel in diverse scenarios accurately.
In this paper, we present Tube-Link, a versatile framework that addresses multiple core tasks …

Dvis: Decoupled video instance segmentation framework

T Zhang, X Tian, Y Wu, S Ji, X Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video instance segmentation (VIS) is a critical task with diverse applications, including
autonomous driving and video editing. Existing methods often underperform on complex …

Spectrum-guided multi-granularity referring video object segmentation

B Miao, M Bennamoun, Y Gao… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Current referring video object segmentation (R-VOS) techniques extract conditional kernels
from encoded (low-resolution) vision-language features to segment the decoded high …

Ctvis: Consistent training for online video instance segmentation

K Ying, Q Zhong, W Mao, Z Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The discrimination of instance embeddings plays a vital role in associating instances across
time for online video instance segmentation (VIS). Instance embedding learning is directly …

Soc: Semantic-assisted object cluster for referring video object segmentation

Z Luo, Y **ao, Y Liu, S Li, Y Wang… - Advances in …, 2023 - proceedings.neurips.cc
This paper studies referring video object segmentation (RVOS) by boosting video-level
visual-linguistic alignment. Recent approaches model the RVOS task as a sequence …