EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery

G Wang, L Bai, J Wang, K Yuan, Z Li, T Jiang… - arxiv preprint arxiv …, 2025 - arxiv.org
Recently, Multimodal Large Language Models (MLLMs) have demonstrated their immense
potential in computer-aided diagnosis and decision-making. In the context of robotic …

Efficient Few-Shot Medical Image Analysis via Hierarchical Contrastive Vision-Language Learning

H Fuller, FG Garcia, V Flores - arxiv preprint arxiv:2501.09294, 2025 - arxiv.org
Few-shot learning in medical image classification presents a significant challenge due to the
limited availability of annotated data and the complex nature of medical imagery. In this …