- Academic Search

Z Lin, D Zhang, Q Tao, D Shi, G Haffari, Q Wu… - Artificial Intelligence in …, 2023 - Elsevier

Abstract Medical Visual Question Answering (VQA) is a combination of medical artificial
intelligence and popular VQA challenges. Given a medical image and a clinically relevant …

Save Cite Cited by 112 Related articles All 8 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Pmc-vqa: Visual instruction tuning for medical visual question answering

X Zhang, C Wu, Z Zhao, W Lin, Y Zhang… - ar** review on multimodal deep learning in biomedical images and texts

Z Sun, M Lin, Q Zhu, Q **e, F Wang, Z Lu… - Journal of Biomedical …, 2023 - Elsevier

Objective Computer-assisted diagnostic and prognostic systems of the future should be
capable of simultaneously processing multimodal data. Multimodal deep learning (MDL) …

Save Cite Cited by 15 Related articles All 9 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Pmc-clip: Contrastive language-image pre-training using biomedical documents

W Lin, Z Zhao, X Zhang, C Wu, Y Zhang… - … Conference on Medical …, 2023 - Springer

Foundation models trained on large-scale dataset gain a recent surge in CV and NLP. In
contrast, development in biomedical domain lags far behind due to data scarcity. To address …

Save Cite Cited by 134 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Multi-modal masked autoencoders for medical vision-and-language pre-training

Z Chen, Y Du, J Hu, Y Liu, G Li, X Wan… - … Conference on Medical …, 2022 - Springer

Medical vision-and-language pre-training provides a feasible solution to extract effective
vision-and-language representations from medical images and texts. However, few studies …

Save Cite Cited by 125 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Align, reason and learn: Enhancing medical vision-and-language pre-training with knowledge

Z Chen, G Li, X Wan - Proceedings of the 30th ACM International …, 2022 - dl.acm.org

Medical vision-and-language pre-training (Med-VLP) has received considerable attention
owing to its applicability to extracting generic vision-and-language representations from …

Save Cite Cited by 61 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Does clip benefit visual question answering in the medical domain as much as it does in the general domain?

S Eslami, G de Melo, C Meinel - arxiv preprint arxiv:2112.13906, 2021 - arxiv.org

Contrastive Language--Image Pre-training (CLIP) has shown remarkable success in
learning with cross-modal supervision from extensive amounts of image--text pairs collected …

Save Cite Cited by 103 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aclanthology.org

Pubmedclip: How much does clip benefit visual question answering in the medical domain?

S Eslami, C Meinel, G De Melo - Findings of the Association for …, 2023 - aclanthology.org

Abstract Contrastive Language–Image Pre-training (CLIP) has shown remarkable success
in learning with cross-modal supervision from extensive amounts of image–text pairs …

Save Cite Cited by 98 Related articles View as HTML

[Free GPT-4]

[PDF] arxiv.org

Open-ended medical visual question answering through prefix tuning of language models

T Van Sonsbeek, MM Derakhshani… - … Conference on Medical …, 2023 - Springer

Abstract Medical Visual Question Answering (VQA) is an important challenge, as it would
lead to faster and more accurate diagnoses and treatment decisions. Most existing methods …

Save Cite Cited by 57 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Towards unifying medical vision-and-language pre-training via soft prompts

Z Chen, S Diao, B Wang, G Li… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Medical vision-and-language pre-training (Med-VLP) has shown promising improvements
on many downstream medical tasks owing to its applicability to extracting generic …

Save Cite Cited by 32 Related articles All 5 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Contrastive pre-training and representation distillation for medical visual question answering...

Medical visual question answering: A survey

Pmc-vqa: Visual instruction tuning for medical visual question answering

Pmc-clip: Contrastive language-image pre-training using biomedical documents

Multi-modal masked autoencoders for medical vision-and-language pre-training

Align, reason and learn: Enhancing medical vision-and-language pre-training with knowledge

Does clip benefit visual question answering in the medical domain as much as it does in the general domain?

Pubmedclip: How much does clip benefit visual question answering in the medical domain?

Open-ended medical visual question answering through prefix tuning of language models

Towards unifying medical vision-and-language pre-training via soft prompts