Foundation Models Defining a New Era in Vision: a Survey and Outlook
Vision systems that see and reason about the compositional nature of visual scenes are
fundamental to understanding our world. The complex relations between objects and their …
fundamental to understanding our world. The complex relations between objects and their …
[HTML][HTML] Review of large vision models and visual prompt engineering
Visual prompt engineering is a fundamental methodology in the field of visual and image
artificial general intelligence. As the development of large vision models progresses, the …
artificial general intelligence. As the development of large vision models progresses, the …
Sam 2: Segment anything in images and videos
We present Segment Anything Model 2 (SAM 2), a foundation model towards solving
promptable visual segmentation in images and videos. We build a data engine, which …
promptable visual segmentation in images and videos. We build a data engine, which …
Segment anything model for medical images?
Abstract The Segment Anything Model (SAM) is the first foundation model for general image
segmentation. It has achieved impressive results on various natural image segmentation …
segmentation. It has achieved impressive results on various natural image segmentation …
Sam-clip: Merging vision foundation models towards semantic and spatial understanding
The landscape of publicly available vision foundation models (VFMs) such as CLIP and
SAM is expanding rapidly. VFMs are endowed with distinct capabilities stemming from their …
SAM is expanding rapidly. VFMs are endowed with distinct capabilities stemming from their …
Segment anything is not always perfect: An investigation of sam on different real-world applications
Abstract Recently, Meta AI Research approaches a general, promptable segment anything
model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B) …
model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B) …
Ma-sam: Modality-agnostic sam adaptation for 3d medical image segmentation
Abstract The Segment Anything Model (SAM), a foundation model for general image
segmentation, has demonstrated impressive zero-shot performance across numerous …
segmentation, has demonstrated impressive zero-shot performance across numerous …
On the challenges and perspectives of foundation models for medical image analysis
This article discusses the opportunities, applications and future directions of large-scale
pretrained models, ie, foundation models, which promise to significantly improve the …
pretrained models, ie, foundation models, which promise to significantly improve the …
Medsegdiff-v2: Diffusion-based medical image segmentation with transformer
The Diffusion Probabilistic Model (DPM) has recently gained popularity in the field of
computer vision, thanks to its image generation applications, such as Imagen, Latent …
computer vision, thanks to its image generation applications, such as Imagen, Latent …
Large ai models in health informatics: Applications, challenges, and the future
Large AI models, or foundation models, are models recently emerging with massive scales
both parameter-wise and data-wise, the magnitudes of which can reach beyond billions …
both parameter-wise and data-wise, the magnitudes of which can reach beyond billions …