[HTML][HTML] Review of large vision models and visual prompt engineering
Visual prompt engineering is a fundamental methodology in the field of visual and image
artificial general intelligence. As the development of large vision models progresses, the …
artificial general intelligence. As the development of large vision models progresses, the …
A comprehensive survey on segment anything model for vision and beyond
Artificial intelligence (AI) is evolving towards artificial general intelligence, which refers to the
ability of an AI system to perform a wide range of tasks and exhibit a level of intelligence …
ability of an AI system to perform a wide range of tasks and exhibit a level of intelligence …
Ma-sam: Modality-agnostic sam adaptation for 3d medical image segmentation
Abstract The Segment Anything Model (SAM), a foundation model for general image
segmentation, has demonstrated impressive zero-shot performance across numerous …
segmentation, has demonstrated impressive zero-shot performance across numerous …
Segment anything, from space?
Recently, the first foundation model developed specifically for image segmentation tasks
was developed, termed the" Segment Anything Model"(SAM). SAM can segment objects in …
was developed, termed the" Segment Anything Model"(SAM). SAM can segment objects in …
Surgicalsam: Efficient class promptable surgical instrument segmentation
The Segment Anything Model (SAM) is a powerful foundation model that has revolutionised
image segmentation. To apply SAM to surgical instrument segmentation, a common …
image segmentation. To apply SAM to surgical instrument segmentation, a common …
Segment anything model for medical image segmentation: Current applications and future directions
Due to the inherent flexibility of prompting, foundation models have emerged as the
predominant force in the fields of natural language processing and computer vision. The …
predominant force in the fields of natural language processing and computer vision. The …
A survey on lora of large language models
Y Mao, Y Ge, Y Fan, W Xu, Y Mi, Z Hu… - Frontiers of Computer …, 2025 - Springer
Abstract Low-Rank Adaptation (LoRA), which updates the dense neural network layers with
pluggable low-rank matrices, is one of the best performed parameter efficient fine-tuning …
pluggable low-rank matrices, is one of the best performed parameter efficient fine-tuning …
Annotation-free audio-visual segmentation
Abstract The objective of Audio-Visual Segmentation (AVS) is to localise the sounding
objects within visual scenes by accurately predicting pixel-wise segmentation masks. To …
objects within visual scenes by accurately predicting pixel-wise segmentation masks. To …
Deep interactive segmentation of medical images: A systematic review and taxonomy
Interactive segmentation is a crucial research area in medical image analysis aiming to
boost the efficiency of costly annotations by incorporating human feedback. This feedback …
boost the efficiency of costly annotations by incorporating human feedback. This feedback …
Endodac: Efficient adapting foundation model for self-supervised depth estimation from any endoscopic camera
Depth estimation plays a crucial role in various tasks within endoscopic surgery, including
navigation, surface reconstruction, and augmented reality visualization. Despite the …
navigation, surface reconstruction, and augmented reality visualization. Despite the …