A survey of large language models in medicine: Progress, application, and challenge

H Zhou, F Liu, B Gu, X Zou, J Huang, J Wu, Y Li… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs), such as ChatGPT, have received substantial attention due
to their capabilities for understanding and generating human language. While there has …

Testing and evaluation of health care applications of large language models: a systematic review

S Bedi, Y Liu, L Orr-Ewing, D Dash, S Koyejo… - JAMA, 2024 - jamanetwork.com
Importance Large language models (LLMs) can assist in various health care activities, but
current evaluation approaches may not adequately identify the most useful application …

Towards evaluating and building versatile large language models for medicine

C Wu, P Qiu, J Liu, H Gu, N Li, Y Zhang, Y Wang… - npj Digital …, 2025 - nature.com
In this study, we present MedS-Bench, a comprehensive benchmark to evaluate large
language models (LLMs) in clinical contexts, MedS-Bench, spanning 11 high-level clinical …

Zero shot health trajectory prediction using transformer

P Renc, Y Jia, AE Samir, J Was, Q Li, DW Bates… - NPJ Digital …, 2024 - nature.com
Integrating modern machine learning and clinical decision-making has great promise for
mitigating healthcare's increasing cost and complexity. We introduce the Enhanced …

Artificial intelligence in oncology: ensuring safe and effective integration of language models in clinical practice

L Verlingue, C Boyer, L Olgiati, CB Mairesse… - The Lancet Regional …, 2024 - thelancet.com
Summary In this Personal View, we address the latest advancements in automatic text
analysis with artificial intelligence (AI) in medicine, with a focus on its implications in aiding …

[HTML][HTML] Natural Language Processing in Medicine and Ophthalmology: A Review for the 21st-century clinician

W Rojas-Carabali, R Agrawal… - Asia-Pacific Journal of …, 2024 - Elsevier
ABSTRACT Natural Language Processing (NLP) is a subfield of artificial intelligence that
focuses on the interaction between computers and human language, enabling computers to …

Using Large Language Models to Promote Health Equity

E Pierson, D Shanmugam, R Movva, J Kleinberg… - NEJM AI, 2025 - ai.nejm.org
While the discussion about the effects of large language models (LLMs) on health equity has
been largely cautionary, LLMs also present significant opportunities for improving health …

The potential of Generative Pre-trained Transformer 4 (GPT-4) to analyse medical notes in three different languages: a retrospective model-evaluation study

MCS Menezes, AF Hoffmann, ALM Tan… - The Lancet Digital …, 2025 - thelancet.com
Background Patient notes contain substantial information but are difficult for computers to
analyse due to their unstructured format. Large-language models (LLMs), such as …

A systematic assessment of openai o1-preview for higher order thinking in education

E Latif, Y Zhou, S Guo, Y Gao, L Shi… - arxiv preprint arxiv …, 2024 - arxiv.org
As artificial intelligence (AI) continues to advance, it demonstrates capabilities comparable
to human intelligence, with significant potential to transform education and workforce …

Image biomarkers and explainable AI: handcrafted features versus deep learned features

L Rundo, C Militello - European Radiology Experimental, 2024 - Springer
Feature extraction and selection from medical data are the basis of radiomics and image
biomarker discovery for various architectures, including convolutional neural networks …