[HTML][HTML] Review of large vision models and visual prompt engineering

J Wang, Z Liu, L Zhao, Z Wu, C Ma, S Yu, H Dai… - Meta-Radiology, 2023 - Elsevier
Visual prompt engineering is a fundamental methodology in the field of visual and image
artificial general intelligence. As the development of large vision models progresses, the …

A comprehensive survey on segment anything model for vision and beyond

C Zhang, L Liu, Y Cui, G Huang, W Lin, Y Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Artificial intelligence (AI) is evolving towards artificial general intelligence, which refers to the
ability of an AI system to perform a wide range of tasks and exhibit a level of intelligence …

Ma-sam: Modality-agnostic sam adaptation for 3d medical image segmentation

C Chen, J Miao, D Wu, A Zhong, Z Yan, S Kim… - Medical Image …, 2024 - Elsevier
Abstract The Segment Anything Model (SAM), a foundation model for general image
segmentation, has demonstrated impressive zero-shot performance across numerous …

Segment anything, from space?

S Ren, F Luzi, S Lahrichi, K Kassaw… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently, the first foundation model developed specifically for image segmentation tasks
was developed, termed the" Segment Anything Model"(SAM). SAM can segment objects in …

Surgicalsam: Efficient class promptable surgical instrument segmentation

W Yue, J Zhang, K Hu, Y **a, J Luo… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
The Segment Anything Model (SAM) is a powerful foundation model that has revolutionised
image segmentation. To apply SAM to surgical instrument segmentation, a common …

Segment anything model for medical image segmentation: Current applications and future directions

Y Zhang, Z Shen, R Jiao - Computers in Biology and Medicine, 2024 - Elsevier
Due to the inherent flexibility of prompting, foundation models have emerged as the
predominant force in the fields of natural language processing and computer vision. The …

A survey on lora of large language models

Y Mao, Y Ge, Y Fan, W Xu, Y Mi, Z Hu… - Frontiers of Computer …, 2025 - Springer
Abstract Low-Rank Adaptation (LoRA), which updates the dense neural network layers with
pluggable low-rank matrices, is one of the best performed parameter efficient fine-tuning …

Annotation-free audio-visual segmentation

J Liu, Y Wang, C Ju, C Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract The objective of Audio-Visual Segmentation (AVS) is to localise the sounding
objects within visual scenes by accurately predicting pixel-wise segmentation masks. To …

Deep interactive segmentation of medical images: A systematic review and taxonomy

Z Marinov, PF Jäger, J Egger, J Kleesiek… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Interactive segmentation is a crucial research area in medical image analysis aiming to
boost the efficiency of costly annotations by incorporating human feedback. This feedback …

Endodac: Efficient adapting foundation model for self-supervised depth estimation from any endoscopic camera

B Cui, M Islam, L Bai, A Wang, H Ren - International Conference on …, 2024 - Springer
Depth estimation plays a crucial role in various tasks within endoscopic surgery, including
navigation, surface reconstruction, and augmented reality visualization. Despite the …