[HTML][HTML] Large language models for robotics: Opportunities, challenges, and perspectives

J Wang, E Shi, H Hu, C Ma, Y Liu, X Wang… - Journal of Automation …, 2024 - Elsevier
Large language models (LLMs) have undergone significant expansion and have been
increasingly integrated across various domains. Notably, in the realm of robot task planning …

Challenges and barriers of using large language models (LLM) such as ChatGPT for diagnostic medicine with a focus on digital pathology–a recent sco** review

E Ullah, A Parwani, MM Baig, R Singh - Diagnostic pathology, 2024 - Springer
Background The integration of large language models (LLMs) like ChatGPT in diagnostic
medicine, with a focus on digital pathology, has garnered significant attention. However …

Vision-language models for medical report generation and visual question answering: A review

I Hartsock, G Rasool - Frontiers in Artificial Intelligence, 2024 - frontiersin.org
Medical vision-language models (VLMs) combine computer vision (CV) and natural
language processing (NLP) to analyze visual and textual medical data. Our paper reviews …

Clip in medical imaging: A comprehensive survey

Z Zhao, Y Liu, H Wu, M Wang, Y Li, S Wang… - ar** field with several prospective
clinical studies demonstrating its benefits in clinical practice. In 2022, the Korean Society of …

AMAM: an attention-based multimodal alignment model for medical visual question answering

H Pan, S He, K Zhang, B Qu, C Chen, K Shi - Knowledge-Based Systems, 2022 - Elsevier
Abstract Medical Visual Question Answering (VQA) is a multimodal task to answer clinical
questions about medical images. Existing methods have achieved good performance, but …