Advances in medical image analysis with vision transformers: a comprehensive review

R Azad, A Kazerouni, M Heidari, EK Aghdam… - Medical Image …, 2024 - Elsevier
The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …

Transformers in medical imaging: A survey

F Shamshad, S Khan, SW Zamir, MH Khan… - Medical Image …, 2023 - Elsevier
Following unprecedented success on the natural language tasks, Transformers have been
successfully applied to several computer vision problems, achieving state-of-the-art results …

Dynamic graph enhanced contrastive learning for chest x-ray report generation

M Li, B Lin, Z Chen, H Lin, X Liang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Automatic radiology reporting has great clinical potential to relieve radiologists from heavy
workloads and improve diagnosis interpretation. Recently, researchers have enhanced data …

Interactive and explainable region-guided radiology report generation

T Tanida, P Müller, G Kaissis… - Proceedings of the …, 2023 - openaccess.thecvf.com
The automatic generation of radiology reports has the potential to assist radiologists in the
time-consuming task of report writing. Existing methods generate the full report from image …

Metransformer: Radiology report generation by transformer with multiple learnable expert tokens

Z Wang, L Liu, L Wang, L Zhou - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In clinical scenarios, multi-specialist consultation could significantly benefit the diagnosis,
especially for intricate cases. This inspires us to explore a" multi-expert joint diagnosis" …

Kiut: Knowledge-injected u-transformer for radiology report generation

Z Huang, X Zhang, S Zhang - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Radiology report generation aims to automatically generate a clinically accurate and
coherent paragraph from the X-ray image, which could relieve radiologists from the heavy …

Prior: Prototype representation joint learning from medical images and reports

P Cheng, L Lin, J Lyu, Y Huang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Contrastive learning based vision-language joint pre-training has emerged as a successful
representation learning strategy. In this paper, we present a prototype representation …

Cross-modal prototype driven network for radiology report generation

J Wang, A Bhalerao, Y He - European Conference on Computer Vision, 2022 - Springer
Radiology report generation (RRG) aims to describe automatically a radiology image with
human-like language and could potentially support the work of radiologists, reducing the …

Reinforced cross-modal alignment for radiology report generation

H Qin, Y Song - Findings of the Association for Computational …, 2022 - aclanthology.org
Medical images are widely used in clinical decision-making, where writing radiology reports
is a potential application that can be enhanced by automatic solutions to alleviate …

Clinical-bert: Vision-language pre-training for radiograph diagnosis and reports generation

B Yan, M Pei - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
In this paper, we propose a vision-language pre-training model, Clinical-BERT, for the
medical domain, and devise three domain-specific tasks: Clinical Diagnosis (CD), Masked …