Medical artificial intelligence and human values

KH Yu, E Healey, TY Leong, IS Kohane… - New England Journal …, 2024‏ - Mass Medical Soc
Key Points Medical Artificial Intelligence and Human Values As large language models and
other artificial intelligence models are used more in medicine, ethical dilemmas can arise …

A survey of large language models in medicine: Progress, application, and challenge

H Zhou, F Liu, B Gu, X Zou, J Huang, J Wu, Y Li… - arxiv preprint arxiv …, 2023‏ - arxiv.org
Large language models (LLMs), such as ChatGPT, have received substantial attention due
to their capabilities for understanding and generating human language. While there has …

Large language model influence on diagnostic reasoning: a randomized clinical trial

E Goh, R Gallo, J Hom, E Strong, Y Weng… - JAMA Network …, 2024‏ - jamanetwork.com
Importance Large language models (LLMs) have shown promise in their performance on
both multiple-choice and open-ended medical reasoning examinations, but it remains …

Rethinking interpretability in the era of large language models

C Singh, JP Inala, M Galley, R Caruana… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Interpretable machine learning has exploded as an area of interest over the last decade,
sparked by the rise of increasingly large datasets and deep neural networks …

Tribulations and future opportunities for artificial intelligence in precision medicine

C Carini, AA Seyhan - Journal of Translational Medicine, 2024‏ - Springer
Upon a diagnosis, the clinical team faces two main questions: what treatment, and at what
dose? Clinical trials' results provide the basis for guidance and support for official protocols …

Building machines that learn and think with people

KM Collins, I Sucholutsky, U Bhatt, K Chandra… - Nature human …, 2024‏ - nature.com
What do we want from machine intelligence? We envision machines that are not just tools
for thought but partners in thought: reasonable, insightful, knowledgeable, reliable and …

An evaluation framework for clinical use of large language models in patient interaction tasks

S Johri, J Jeong, BA Tran, DI Schlessinger… - Nature Medicine, 2025‏ - nature.com
The integration of large language models (LLMs) into clinical diagnostics has the potential to
transform doctor–patient interactions. However, the readiness of these models for real-world …

A future role for health applications of large language models depends on regulators enforcing safety standards

O Freyer, IC Wiest, JN Kather, S Gilbert - The Lancet Digital Health, 2024‏ - thelancet.com
Among the rapid integration of artificial intelligence in clinical settings, large language
models (LLMs), such as Generative Pre-trained Transformer-4, have emerged as …

Medbench: A large-scale chinese benchmark for evaluating medical large language models

Y Cai, L Wang, Y Wang, G de Melo, Y Zhang… - Proceedings of the …, 2024‏ - ojs.aaai.org
The emergence of various medical large language models (LLMs) in the medical domain
has highlighted the need for unified evaluation standards, as manual evaluation of LLMs …

AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments

S Schmidgall, R Ziaei, C Harris, E Reis… - arxiv preprint arxiv …, 2024‏ - arxiv.org
Evaluating large language models (LLM) in clinical scenarios is crucial to assessing their
potential clinical utility. Existing benchmarks rely heavily on static question-answering, which …