[HTML][HTML] Review of large vision models and visual prompt engineering

J Wang, Z Liu, L Zhao, Z Wu, C Ma, S Yu, H Dai… - Meta-Radiology, 2023 - Elsevier
Visual prompt engineering is a fundamental methodology in the field of visual and image
artificial general intelligence. As the development of large vision models progresses, the …

A comprehensive survey on segment anything model for vision and beyond

C Zhang, L Liu, Y Cui, G Huang, W Lin, Y Yang… - arxiv preprint arxiv …, 2023 - arxiv.org
Artificial intelligence (AI) is evolving towards artificial general intelligence, which refers to the
ability of an AI system to perform a wide range of tasks and exhibit a level of intelligence …

Efficientsam: Leveraged masked image pretraining for efficient segment anything

Y **ong, B Varadarajan, L Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Segment Anything Model (SAM) has emerged as a powerful tool for numerous
vision applications. A key component that drives the impressive performance for zero-shot …

A survey on segment anything model (sam): Vision foundation model meets prompt engineering

C Zhang, FD Puspitasari, S Zheng, C Li, Y Qiao… - arxiv preprint arxiv …, 2023 - arxiv.org
Segment anything model (SAM) developed by Meta AI Research has recently attracted
significant attention. Trained on a large segmentation dataset of over 1 billion masks, SAM is …

Explainable generative ai (genxai): A survey, conceptualization, and research agenda

J Schneider - Artificial Intelligence Review, 2024 - Springer
Generative AI (GenAI) represents a shift from AI's ability to “recognize” to its ability to
“generate” solutions for a wide range of tasks. As generated solutions and applications grow …

Vale: A multimodal visual and language explanation framework for image classifiers using explainable ai and language models

P Natarajan, A Nambiar - arxiv preprint arxiv:2408.12808, 2024 - arxiv.org
Deep Neural Networks (DNNs) have revolutionized various fields by enabling task
automation and reducing human error. However, their internal workings and decision …

Explainable concept generation through vision-language preference learning

A Taparia, S Sagar, R Senanayake - arxiv preprint arxiv:2408.13438, 2024 - arxiv.org
Concept-based explanations have become a popular choice for explaining deep neural
networks post-hoc because, unlike most other explainable AI techniques, they can be used …

Evaluating readability and faithfulness of concept-based explanations

M Li, H **, R Huang, Z Xu, D Lian, Z Lin… - arxiv preprint arxiv …, 2024 - arxiv.org
With the growing popularity of general-purpose Large Language Models (LLMs), comes a
need for more global explanations of model behaviors. Concept-based explanations arise …

Eliminating information leakage in hard concept bottleneck models with supervised, hierarchical concept learning

A Sun, Y Yuan, P Ma, S Wang - arxiv preprint arxiv:2402.05945, 2024 - arxiv.org
Concept Bottleneck Models (CBMs) aim to deliver interpretable and interventionable
predictions by bridging features and labels with human-understandable concepts. While …

Trans-SAM: Transfer Segment Anything Model to medical image segmentation with Parameter-Efficient Fine-Tuning

Y Wu, Z Wang, X Yang, H Kang, A He, T Li - Knowledge-Based Systems, 2025 - Elsevier
Abstract Recently, the Segment Anything Model (SAM) has gained substantial attention in
image segmentation due to its remarkable performance. It has demonstrated impressive …