The ethics of ChatGPT in medicine and healthcare: a systematic review on Large Language Models (LLMs)

J Haltaufderheide, R Ranisch - NPJ digital medicine, 2024 - nature.com
With the introduction of ChatGPT, Large Language Models (LLMs) have received enormous
attention in healthcare. Despite potential benefits, researchers have underscored various …

A future role for health applications of large language models depends on regulators enforcing safety standards

O Freyer, IC Wiest, JN Kather, S Gilbert - The Lancet Digital Health, 2024 - thelancet.com
Among the rapid integration of artificial intelligence in clinical settings, large language
models (LLMs), such as Generative Pre-trained Transformer-4, have emerged as …

A toolbox for surfacing health equity harms and biases in large language models

SR Pfohl, H Cole-Lewis, R Sayres, D Neal, M Asiedu… - Nature Medicine, 2024 - nature.com
Large language models (LLMs) hold promise to serve complex health information needs but
also have the potential to introduce harm and exacerbate health disparities. Reliably …

Adapted large language models can outperform medical experts in clinical text summarization

D Van Veen, C Van Uden, L Blankemeier… - Nature medicine, 2024 - nature.com
Analyzing vast textual data and summarizing key information from electronic health records
imposes a substantial burden on how clinicians allocate their time. Although large language …

Closing the gap between open source and commercial large language models for medical evidence summarization

G Zhang, Q **, Y Zhou, S Wang, B Idnay, Y Luo… - npj Digital …, 2024 - nature.com
Large language models (LLMs) hold great promise in summarizing medical evidence. Most
recent studies focus on the application of proprietary LLMs. Using proprietary LLMs …

[HTML][HTML] Clinical text summarization: adapting large language models can outperform human experts

D Van Veen, C Van Uden, L Blankemeier… - Research …, 2023 - ncbi.nlm.nih.gov
Sifting through vast textual data and summarizing key information from electronic health
records (EHR) imposes a substantial burden on how clinicians allocate their time. Although …

Large language model–based responses to patients' in-basket messages

WR Small, B Wiesenfeld, B Brandfield-Harvey… - JAMA network …, 2024 - jamanetwork.com
Importance Virtual patient-physician communications have increased since 2020 and
negatively impacted primary care physician (PCP) well-being. Generative artificial …

Demographic bias in misdiagnosis by computational pathology models

A Vaidya, RJ Chen, DFK Williamson, AH Song… - Nature Medicine, 2024 - nature.com
Despite increasing numbers of regulatory approvals, deep learning-based computational
pathology systems often overlook the impact of demographic factors on performance …

The TRIPOD-LLM reporting guideline for studies using large language models

J Gallifant, M Afshar, S Ameen, Y Aphinyanaphongs… - Nature Medicine, 2025 - nature.com
Large language models (LLMs) are rapidly being adopted in healthcare, necessitating
standardized reporting guidelines. We present transparent reporting of a multivariable …

Preventing harm from non-conscious bias in medical generative AI

J Hastings - The Lancet Digital Health, 2024 - thelancet.com
Large language models such as OpenAI's GPT-4 have the potential to transform medicine1
by enabling automation of a range of tasks, including writing discharge summaries, 2 …