- Academic Search

Articles

Scholar

1 result (0.02 sec)

My profile My library

HyperSeg: Towards Universal Visual Segmentation with Large Language Model

Search within citing articles

[Free GPT-4]

[PDF] arxiv.org

InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models

C Wei, Y Zhong, H Tan, Y Zeng, Y Liu, Z Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org

Boosted by Multi-modal Large Language Models (MLLMs), text-guided universal
segmentation models for the image and video domains have made rapid progress recently …

Save Cite Related articles All 2 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

HyperSeg: Towards Universal Visual Segmentation with Large Language Model

InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models