Universal instance perception as object discovery and retrieval

B Yan, Y Jiang, J Wu, D Wang, P Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com
All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …

MOSE: A new dataset for video object segmentation in complex scenes

H Ding, C Liu, S He, X Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video object segmentation (VOS) aims at segmenting a particular object throughout the
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …

Towards grand unification of object tracking

B Yan, Y Jiang, P Sun, D Wang, Z Yuan, P Luo… - European conference on …, 2022 - Springer
We present a unified method, termed Unicorn, that can simultaneously solve four tracking
problems (SOT, MOT, VOS, MOTS) with a single network using the same model parameters …

End-to-end video instance segmentation with transformers

Y Wang, Z Xu, X Wang, C Shen… - Proceedings of the …, 2021 - openaccess.thecvf.com
Video instance segmentation (VIS) is the task that requires simultaneously classifying,
segmenting and tracking object instances of interest in video. Recent methods typically …

Instances as queries

Y Fang, S Yang, X Wang, Y Li, C Fang… - Proceedings of the …, 2021 - openaccess.thecvf.com
We present QueryInst, a new perspective for instance segmentation. QueryInst is a multi-
stage end-to-end system that treats instances of interest as learnable queries, enabling …

A generalist framework for panoptic segmentation of images and videos

T Chen, L Li, S Saxena, G Hinton… - Proceedings of the …, 2023 - openaccess.thecvf.com
Panoptic segmentation assigns semantic and instance ID labels to every pixel of an image.
As permutations of instance IDs are also valid solutions, the task requires learning of high …

In defense of online models for video instance segmentation

J Wu, Q Liu, Y Jiang, S Bai, A Yuille, X Bai - European Conference on …, 2022 - Springer
In recent years, video instance segmentation (VIS) has been largely advanced by offline
models, while online models gradually attracted less attention possibly due to their inferior …

[HTML][HTML] Coarse-to-fine video instance segmentation with factorized conditional appearance flows

Z Qin, X Lu, X Nie, D Liu, Y Yin, W Wang - IEEE/CAA Journal of …, 2023 - ieee-jas.net
We introduce a novel method using a new generative model that automatically learns
effective representations of the target and background appearance to detect, segment and …

Language as queries for referring video object segmentation

J Wu, Y Jiang, P Sun, Z Yuan… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Referring video object segmentation (R-VOS) is an emerging cross-modal task that aims to
segment the target object referred by a language expression in all video frames. In this work …

Sg-net: Spatial granularity network for one-stage video instance segmentation

D Liu, Y Cui, W Tan, Y Chen - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Video instance segmentation (VIS) is a new and critical task in computer vision. To date, top-
performing VIS methods extend the two-stage Mask R-CNN by adding a tracking branch …