- Academic Search

S Lu, M Liu, L Yin, Z Yin, X Liu, W Zheng - PeerJ Computer Science, 2023 - peerj.com

Abstract Visual Question Answering (VQA) is a significant cross-disciplinary issue in the
fields of computer vision and natural language processing that requires a computer to output …

Save Cite Cited by 219 Related articles All 8 versions Free GPT-4 DeepSeek Cached

[Free GPT-4]
[DeepSeek]

[PDF] frontiersin.org

Vision-language models for medical report generation and visual question answering: A review

I Hartsock, G Rasool - Frontiers in Artificial Intelligence, 2024 - frontiersin.org

Medical vision-language models (VLMs) combine computer vision (CV) and natural
language processing (NLP) to analyze visual and textual medical data. Our paper reviews …

Save Cite Cited by 42 Related articles All 2 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

[PDF][PDF] Large-scale domain-specific pretraining for biomedical vision-language processing

S Zhang, Y Xu, N Usuyama, J Bagga… - arxiv preprint arxiv …, 2023 - researchgate.net

Contrastive pretraining on parallel image-text data has attained great success in vision-
language processing (VLP), as exemplified by CLIP and related methods. However, prior …

Save Cite Cited by 163 Related articles View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Slake: A semantically-labeled knowledge-enhanced dataset for medical visual question answering

B Liu, LM Zhan, L Xu, L Ma, Y Yang… - 2021 IEEE 18th …, 2021 - ieeexplore.ieee.org

Medical visual question answering (Med-VQA) has tremendous potential in healthcare.
However, the development of this technology is hindered by the lacking of publicly-available …

Save Cite Cited by 257 Related articles All 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Endora: Video Generation Models as Endoscopy Simulators

C Li, H Liu, Y Liu, BY Feng, W Li, X Liu, Z Chen… - … Conference on Medical …, 2024 - Springer

Generative models hold promise for revolutionizing medical education, robot-assisted
surgery, and data augmentation for machine learning. Despite progress in generating 2D …

Save Cite Cited by 35 Related articles All 2 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Natural language processing for smart healthcare

B Zhou, G Yang, Z Shi, S Ma - IEEE Reviews in Biomedical …, 2022 - ieeexplore.ieee.org

Smart healthcare has achieved significant progress in recent years. Emerging artificial
intelligence (AI) technologies enable various smart applications across various healthcare …

Save Cite Cited by 150 Related articles All 9 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Biomedical question answering: a survey of approaches and challenges

Q **, Z Yuan, G **ong, Q Yu, H Ying, C Tan… - ACM Computing …, 2022 - dl.acm.org

Automatic Question Answering (QA) has been successfully applied in various domains such
as search engines and chatbots. Biomedical QA (BQA), as an emerging QA task, enables …

Save Cite Cited by 130 Related articles All 4 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mmbert: Multimodal bert pretraining for improved medical vqa

Y Khare, V Bagal, M Mathew, A Devi… - 2021 IEEE 18th …, 2021 - ieeexplore.ieee.org

Images in the medical domain are fundamentally different from the general domain images.
Consequently, it is infeasible to directly employ general domain Visual Question Answering …

Save Cite Cited by 141 Related articles All 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multiple meta-model quantifying for medical visual question answering

T Do, BX Nguyen, E Tjiputra, M Tran, QD Tran… - … Image Computing and …, 2021 - Springer

Transfer learning is an important step to extract meaningful features and overcome the data
limitation in the medical Visual Question Answering (VQA) task. However, most of the …

Save Cite Cited by 121 Related articles All 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Align, reason and learn: Enhancing medical vision-and-language pre-training with knowledge

Z Chen, G Li, X Wan - Proceedings of the 30th ACM International …, 2022 - dl.acm.org

Medical vision-and-language pre-training (Med-VLP) has received considerable attention
owing to its applicability to extracting generic vision-and-language representations from …

Save Cite Cited by 63 Related articles All 4 versions Free GPT-4 DeepSeek

Create alert

Cite

Advanced search

Saved to My library

Medical visual question answering via conditional reasoning

The multi-modal fusion in visual question answering: a review of attention mechanisms

Vision-language models for medical report generation and visual question answering: A review

[PDF][PDF] Large-scale domain-specific pretraining for biomedical vision-language processing

Slake: A semantically-labeled knowledge-enhanced dataset for medical visual question answering

Endora: Video Generation Models as Endoscopy Simulators

Natural language processing for smart healthcare

Biomedical question answering: a survey of approaches and challenges

Mmbert: Multimodal bert pretraining for improved medical vqa

Multiple meta-model quantifying for medical visual question answering

Align, reason and learn: Enhancing medical vision-and-language pre-training with knowledge