- Academic Search

J Tian, L Aggarwal, A Colaco, Z Kira… - Proceedings of the …, 2024 - openaccess.thecvf.com

Producing quality segmentation masks for images is a fundamental problem in computer
vision. Recent research has explored large-scale supervised training to enable zero-shot …

保存引用被引用数: 61 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

D Yang, R Dong, J Ji, Y Ma, H Wang, X Sun… - European Conference on …, 2024 - Springer

Recently, diffusion models have increasingly demonstrated their capabilities in vision
understanding. By leveraging prompt-based learning to construct sentences, these models …

保存引用関連記事全 7 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation

Y Kawano, Y Aoki - arxiv preprint arxiv:2403.11194, 2024 - arxiv.org

Semantic segmentation is essential in computer vision for various applications, yet
traditional approaches face significant challenges, including the high cost of annotation and …

保存引用被引用数: 3 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models

Y Wang, S Yuan, X Jian, W Pang, M Wang… - arxiv preprint arxiv …, 2024 - arxiv.org

While recent progress in video-text retrieval has been driven by the exploration of powerful
model architectures and training strategies, the representation learning ability of video-text …

保存引用被引用数: 2 関連記事全 2 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Network-free, unsupervised semantic segmentation with synthetic images

Diffuse Attend and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation

HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models