The TRIPOD-LLM reporting guideline for studies using large language models

J Gallifant, M Afshar, S Ameen, Y Aphinyanaphongs… - Nature Medicine, 2025 - nature.com
Large language models (LLMs) are rapidly being adopted in healthcare, necessitating
standardized reporting guidelines. We present transparent reporting of a multivariable …

A comprehensive survey of large language models and multimodal large language models in medicine

H **ao, F Zhou, X Liu, T Liu, Z Li, X Liu, X Huang - Information Fusion, 2024 - Elsevier
Since the release of ChatGPT and GPT-4, large language models (LLMs) and multimodal
large language models (MLLMs) have attracted widespread attention for their exceptional …

GPT-4 assistance for improvement of physician performance on patient care tasks: a randomized controlled trial

E Goh, RJ Gallo, E Strong, Y Weng, H Kerman… - Nature Medicine, 2025 - nature.com
While large language models (LLMs) have shown promise in diagnostic reasoning, their
impact on management reasoning, which involves balancing treatment decisions and …

Superhuman performance of a large language model on the reasoning tasks of a physician

PG Brodeur, TA Buckley, Z Kanjee, E Goh… - arxiv preprint arxiv …, 2024 - arxiv.org
Performance of large language models (LLMs) on medical tasks has traditionally been
evaluated using multiple choice question benchmarks. However, such benchmarks are …

Large language models improve clinical decision making of medical students through patient simulation and structured feedback: a randomized controlled trial

E Brügge, S Ricchizzi, M Arenbeck, MN Keller… - BMC Medical …, 2024 - Springer
Background Clinical decision-making (CDM) refers to physicians' ability to gather, evaluate,
and interpret relevant diagnostic information. An integral component of CDM is the medical …

Establishing best practices in large language model research: an application to repeat prompting

RJ Gallo, M Baiocchi, TR Savage… - Journal of the American …, 2025 - academic.oup.com
Objectives We aimed to demonstrate the importance of establishing best practices in large
language model research, using repeat prompting as an illustrative example. Materials and …

Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms

X Yang, T Li, H Wang, R Zhang, Z Ni, N Liu, H Zhai… - npj Digital …, 2025 - nature.com
Faced with challenging cases, doctors are increasingly seeking diagnostic advice from large
language models (LLMs). This study aims to compare the ability of LLMs and human …

Evaluation and Regulation of Artificial Intelligence Medical Devices for Clinical Decision Support

GE Weissman - Annual Review of Biomedical Data Science, 2025 - annualreviews.org
Artificial intelligence (AI) methods were first developed nearly seven decades ago. Only in
recent years have they demonstrated their potential to improve clinical care at the bedside …

[HTML][HTML] Voice EHR: introducing multimodal audio data for health

J Anibal, H Huth, M Li, L Hazen, V Daoud… - Frontiers in Digital …, 2025 - pmc.ncbi.nlm.nih.gov
Introduction Artificial intelligence (AI) models trained on audio data may have the potential to
rapidly perform clinical tasks, enhancing medical decision-making and potentially improving …

Artificial intelligence in clinical genetics

D Duong, BD Solomon - European Journal of Human Genetics, 2025 - nature.com
Artificial intelligence (AI) has been growing more powerful and accessible, and will
increasingly impact many areas, including virtually all aspects of medicine and biomedical …