Testing and evaluation of health care applications of large language models: a systematic review

S Bedi, Y Liu, L Orr-Ewing, D Dash, S Koyejo… - JAMA, 2024 - jamanetwork.com
Importance Large language models (LLMs) can assist in various health care activities, but
current evaluation approaches may not adequately identify the most useful application …

A systematic review of testing and evaluation of healthcare applications of large language models (LLMs)

S Bedi, Y Liu, L Orr-Ewing, D Dash, S Koyejo… - medRxiv, 2024 - medrxiv.org
1. Abstract Importance Large Language Models (LLMs) can assist in a wide range of
healthcare-related activities. Current approaches to evaluating LLMs make it difficult to …

[HTML][HTML] Interpretability of AI race detection model in medical imaging with saliency methods

S Konate, L Lebrat, R Santa Cruz, JW Gichoya… - Computational and …, 2025 - Elsevier
Deep neural networks (DNNs) are powerful tools for classifying images. Using these
convolutional models for medical images is challenging due to their complexity and large …