LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

H Ding, L Hong, C Liu, N Xu, L Yang, Y Fan… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite the promising performance of current video segmentation models on existing
benchmarks, these models still struggle with complex scenes. In this paper, we introduce the …

The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation

T Tran - arxiv preprint arxiv:2408.12447, 2024 - arxiv.org
Referring Video Object Segmentation (RVOS) is a challenging task due to its requirement for
temporal understanding. Due to the obstacle of computational complexity, many state-of-the …

UNINEXT-Cutie: The 1st Solution for LSVOS Challenge RVOS Track

H Fang, F Pan, X Lu, W Zhang, R Cong - arxiv preprint arxiv:2408.10129, 2024 - arxiv.org
Referring video object segmentation (RVOS) relies on natural language expressions to
segment target objects in video. In this year, LSVOS Challenge RVOS Track replaced the …