Rocov2: Radiology objects in context version 2, an updated multimodal image dataset

J Rückert, L Bloch, R Brüngel, A Idrissi-Yaghir… - Scientific Data, 2024 - nature.com
Automated medical image analysis systems often require large amounts of training data with
high quality labels, which are difficult and time consuming to generate. This paper …

Advancing medical imaging with language models: featuring a spotlight on ChatGPT

M Hu, J Qian, S Pan, Y Li, RLJ Qiu… - Physics in Medicine & …, 2024 - iopscience.iop.org
This review paper aims to serve as a comprehensive guide and instructional resource for
researchers seeking to effectively implement language models in medical imaging research …

Overview of ImageCLEFmedical 2023–caption prediction and concept detection

J Rückert, A Ben Abacha… - Working Notes of the …, 2023 - arodes.hes-so.ch
Résumé The 2023 ImageCLEFmedical GANs task is the first edition of this task, examining
the existing hypothesis that GANs (Generative Adversarial Networks) are generating …

Large Language Model for Medical Images: A Survey of Taxonomy, Systematic Review, and Future Trends

P Wang, W Lu, C Lu, R Zhou, M Li… - Big Data Mining and …, 2025 - ieeexplore.ieee.org
The advent of Large Language Models (LLMs) has sparked considerable interest in the
medical image domain, as they can generalize to multiple tasks and offer outstanding …

Maken: Improving medical report generation with adapter tuning and knowledge enhancement in vision-language foundation models

S Wu, B Yang, Z Ye, H Wang, H Zheng… - … on Biomedical Imaging …, 2024 - ieeexplore.ieee.org
Medical report generation demands automatic creation of coherent and precise descriptions
for medical images. However, the scarcity of labelled medical image-report pairs poses …

UIT-DarkCow team at ImageCLEFmedical Caption 2024: Diagnostic Captioning for Radiology Images Efficiency with Transformer Models

Q Van Nguyen, HQ Pham, DQ Tran… - arxiv preprint arxiv …, 2024 - arxiv.org
Purpose: This study focuses on the development of automated text generation from
radiology images, termed diagnostic captioning, to assist medical professionals in reducing …

Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models

S Wu, B Yang, Z Ye, H Wang, H Zheng… - arxiv preprint arxiv …, 2023 - arxiv.org
Medical report generation demands automatic creation of coherent and precise descriptions
for medical images. However, the scarcity of labelled medical image-report pairs poses …

VTIENet: visual-text information enhancement network for image captioning

J Yang, Y Wei, R Wang, L Xue - Multimedia Systems, 2025 - Springer
Current image captioning methods based on grid features and segmentation features have
achieved some success. However, existing methods combine the two visual features by …

Adopting Generative AI with Precaution in Dentistry: A Review and Reflection

M Xu, C Ye, Z Zeng, C Chang, S Qi… - … on Digital Health …, 2024 - ieeexplore.ieee.org
The progress in large language models (LLMs) brings much excitement and efforts in
medical artificial intelligence, which could transform patient-doctor conversation while …

C2RG: Parameter-efficient Adaptation of 3D Vision and Language Foundation Model for Coronary CTA Report Generation

Z Ye, Y Sun, W Shi, B Yang, S Wu… - 2024 IEEE …, 2024 - ieeexplore.ieee.org
Medical report generation (MRG) is a challenging yet highly demanding task in the
application of multi-modal artificial intelligence in medicine. Typically, training an MRG …