Diffuse Attend and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion
Producing quality segmentation masks for images is a fundamental problem in computer
vision. Recent research has explored large-scale supervised training to enable zero-shot …
vision. Recent research has explored large-scale supervised training to enable zero-shot …
Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model
Recently, diffusion models have increasingly demonstrated their capabilities in vision
understanding. By leveraging prompt-based learning to construct sentences, these models …
understanding. By leveraging prompt-based learning to construct sentences, these models …
MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation
Semantic segmentation is essential in computer vision for various applications, yet
traditional approaches face significant challenges, including the high cost of annotation and …
traditional approaches face significant challenges, including the high cost of annotation and …
HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models
While recent progress in video-text retrieval has been driven by the exploration of powerful
model architectures and training strategies, the representation learning ability of video-text …
model architectures and training strategies, the representation learning ability of video-text …