Voxelprompt: A vision-language agent for grounded medical image analysis

A Hoopes, VI Butoi, JV Guttag, AV Dalca - arxiv preprint arxiv:2410.08397, 2024 - arxiv.org
We present VoxelPrompt, an agent-driven vision-language framework that tackles diverse
radiological tasks through joint modeling of natural language, image volumes, and analytical …

Interpretable bilingual multimodal large language model for diverse biomedical tasks

L Wang, H Wang, H Yang, J Mao, Z Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
Several medical Multimodal Large Languange Models (MLLMs) have been developed to
address tasks involving visual images with textual instructions across various medical …

Can Modern LLMs Act as Agent Cores in Radiology~ Environments?

Q Zheng, C Wu, P Qiu, L Dai, Y Zhang, Y Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Advancements in large language models (LLMs) have paved the way for LLM-based agent
systems that offer enhanced accuracy and interpretability across various domains …