A comprehensive survey of foundation models in medicine

W Khan, S Leem, KB See, JK Wong… - IEEE Reviews in …, 2025 - ieeexplore.ieee.org
Foundation models (FMs) are large-scale deeplearning models that are developed using
large datasets and self-supervised learning methods. These models serve as a base for …

Rocov2: Radiology objects in context version 2, an updated multimodal image dataset

J Rückert, L Bloch, R Brüngel, A Idrissi-Yaghir… - Scientific Data, 2024 - nature.com
Automated medical image analysis systems often require large amounts of training data with
high quality labels, which are difficult and time consuming to generate. This paper …

From query tools to causal architects: Harnessing large language models for advanced causal discovery from data

T Ban, L Chen, X Wang, H Chen - arxiv preprint arxiv:2306.16902, 2023 - arxiv.org
Large Language Models (LLMs) exhibit exceptional abilities for causal analysis between
concepts in numerous societally impactful domains, including medicine, science, and law …

A comprehensive survey of large language models and multimodal large language models in medicine

H **ao, F Zhou, X Liu, T Liu, Z Li, X Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Since the release of ChatGPT and GPT-4, large language models (LLMs) and multimodal
large language models (MLLMs) have garnered significant attention due to their powerful …

Overview of ImageCLEFmedical 2023–caption prediction and concept detection

J Rückert, A Ben Abacha… - Working Notes of the …, 2023 - arodes.hes-so.ch
Résumé The 2023 ImageCLEFmedical GANs task is the first edition of this task, examining
the existing hypothesis that GANs (Generative Adversarial Networks) are generating …

Depression detection in clinical interviews with LLM-empowered structural element graph

Z Chen, J Deng, J Zhou, J Wu, T Qian… - Proceedings of the …, 2024 - aclanthology.org
Depression is a widespread mental health disorder affecting millions globally. Clinical
interviews are the gold standard for assessing depression, but they heavily rely on scarce …

Large Language Model for Medical Images: A Survey of Taxonomy, Systematic Review, and Future Trends

P Wang, W Lu, C Lu, R Zhou, M Li… - Big Data Mining and …, 2025 - ieeexplore.ieee.org
The advent of Large Language Models (LLMs) has sparked considerable interest in the
medical image domain, as they can generalize to multiple tasks and offer outstanding …

Maken: Improving medical report generation with adapter tuning and knowledge enhancement in vision-language foundation models

S Wu, B Yang, Z Ye, H Wang, H Zheng… - … on Biomedical Imaging …, 2024 - ieeexplore.ieee.org
Medical report generation demands automatic creation of coherent and precise descriptions
for medical images. However, the scarcity of labelled medical image-report pairs poses …

Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models

S Wu, B Yang, Z Ye, H Wang, H Zheng… - arxiv preprint arxiv …, 2023 - arxiv.org
Medical report generation demands automatic creation of coherent and precise descriptions
for medical images. However, the scarcity of labelled medical image-report pairs poses …

ReXErr: Synthesizing Clinically Meaningful Errors in Diagnostic Radiology Reports

VM Rao, S Zhang, JN Acosta, S Adithan… - … 2025: Proceedings of …, 2024 - World Scientific
Accurately interpreting medical images and writing radiology reports is a critical but
challenging task in healthcare. Both human-written and AI-generated reports can contain …