Enhancing representation in radiography-reports foundation model: A granular alignment algorithm using masked contrastive learning

W Huang, C Li, HY Zhou, H Yang, J Liu, Y Liang… - Nature …, 2024 - nature.com
Recently, multi-modal vision-language foundation models have gained significant attention
in the medical field. While these models offer great opportunities, they still face crucial …

Parameter-Efficient Fine-Tuning Medical Multimodal Large Language Models for Medical Visual Grounding

J He, P Li, G Liu, S Zhong - arxiv preprint arxiv:2410.23822, 2024 - arxiv.org
Multimodal Large Language Models (MLLMs) inherit the superior text understanding
capabilities of LLMs and extend these capabilities to multimodal scenarios. These models …

Multimodal Modeling of Radiologist Reasoning on Chest X-rays

D Blasko - 2025 - repositum.tuwien.at
Chest X-rays are a foundational tool for medical diagnostics, and yet interpreting them takes
radiologists' time and is subject to challenges, prompting the development of reliable …