- Academic Search

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Speichern Zitieren Zitiert von: 125 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Effectiveness assessment of recent large vision-language models

Y Jiang, X Yan, GP Ji, K Fu, M Sun, H **ong, DP Fan… - Visual Intelligence, 2024 - Springer

The advent of large vision-language models (LVLMs) represents a remarkable advance in
the quest for artificial general intelligence. However, the models' effectiveness in both …

Speichern Zitieren Zitiert von: 22 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Segpoint: Segment any point cloud via large language model

S He, H Ding, X Jiang, B Wen - European Conference on Computer Vision, 2024 - Springer

Despite significant progress in 3D point cloud segmentation, existing methods primarily
address specific tasks and depend on explicit instructions to identify targets, lacking the …

Speichern Zitieren Zitiert von: 12 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Primitivenet: decomposing the global constraints for referring segmentation

C Liu, X Jiang, H Ding - Visual Intelligence, 2024 - Springer

In referring segmentation, modeling the complicated constraints in the multimodal
information is one of the most challenging problems. As the information in a given language …

Speichern Zitieren Zitiert von: 10 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

RefMask3D: Language-guided transformer for 3D referring segmentation

S He, H Ding - Proceedings of the 32nd ACM International …, 2024 - dl.acm.org

3D referring segmentation is an emerging and challenging vision-language task that aims to
segment the object described by a natural language expression in a point cloud scene. The …

Speichern Zitieren Zitiert von: 2 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] uwa.edu.au

Temporally consistent referring video object segmentation with hybrid memory

B Miao, M Bennamoun, Y Gao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Referring Video Object Segmentation (R-VOS) methods face challenges in maintaining
consistent object segmentation due to temporal context variability and the presence of other …

Speichern Zitieren Zitiert von: 2 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Pvuw 2024 challenge on complex video understanding: Methods and results

H Ding, C Liu, Y Wei, N Ravi, S He, S Bai, P Torr… - arxiv preprint arxiv …, 2024 - arxiv.org

Pixel-level Video Understanding in the Wild Challenge (PVUW) focus on complex video
understanding. In this CVPR 2024 workshop, we add two new tracks, Complex Video Object …

Speichern Zitieren Zitiert von: 3 Ähnliche Artikel Alle 4 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

One token to seg them all: Language instructed reasoning segmentation in videos

Z Bai, T He, H Mei, P Wang, Z Gao, J Chen… - arxiv preprint arxiv …, 2024 - arxiv.org

We introduce VideoLISA, a video-based multimodal large language model designed to
tackle the problem of language-instructed reasoning segmentation in videos. Leveraging the …

Speichern Zitieren Zitiert von: 8 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Motion-grounded video reasoning: Understanding and perceiving motion at pixel level

A Deng, T Chen, S Yu, T Yang, L Spencer… - arxiv preprint arxiv …, 2024 - arxiv.org

In this paper, we introduce Motion-Grounded Video Reasoning, a new motion
understanding task that requires generating visual answers (video segmentation masks) …

Speichern Zitieren Zitiert von: 1 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

H Ding, L Hong, C Liu, N Xu, L Yang, Y Fan… - arxiv preprint arxiv …, 2024 - arxiv.org

Despite the promising performance of current video segmentation models on existing
benchmarks, these models still struggle with complex scenes. In this paper, we introduce the …

Speichern Zitieren Ähnliche Artikel Alle 4 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Decoupling static and hierarchical motion perception for referring video segmentation

Transformer-based visual segmentation: A survey

Effectiveness assessment of recent large vision-language models

Segpoint: Segment any point cloud via large language model

Primitivenet: decomposing the global constraints for referring segmentation

RefMask3D: Language-guided transformer for 3D referring segmentation

Temporally consistent referring video object segmentation with hybrid memory

Pvuw 2024 challenge on complex video understanding: Methods and results

One token to seg them all: Language instructed reasoning segmentation in videos

Motion-grounded video reasoning: Understanding and perceiving motion at pixel level

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation