Mdagents: An adaptive collaboration of llms for medical decision-making

Y Kim, C Park, H Jeong, YS Chan… - Advances in …, 2025 - proceedings.neurips.cc
Foundation models are becoming valuable tools in medicine. Yet despite their promise, the
best way to leverage Large Language Models (LLMs) in complex medical tasks remains an …

Large language models for disease diagnosis: A sco** review

S Zhou, Z Xu, M Zhang, C Xu, Y Guo, Z Zhan… - arxiv preprint arxiv …, 2024 - arxiv.org
Automatic disease diagnosis has become increasingly valuable in clinical practice. The
advent of large language models (LLMs) has catalyzed a paradigm shift in artificial …

The art of saying no: Contextual noncompliance in language models

F Brahman, S Kumar, V Balachandran, P Dasigi… - arxiv preprint arxiv …, 2024 - arxiv.org
Chat-based language models are designed to be helpful, yet they should not comply with
every user request. While most existing work primarily focuses on refusal of" unsafe" …

Uncertainty of thoughts: Uncertainty-aware planning enhances information seeking in LLMs

Z Hu, C Liu, X Feng, Y Zhao, SK Ng, AT Luu… - The Thirty-eighth …, 2024 - openreview.net
In the face of uncertainty, the ability to* seek information* is of fundamental importance. In
many practical applications, such as medical diagnosis and troubleshooting, the information …

Generative ai in medicine

D Shanmugam, M Agrawal, R Movva, IY Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
The increased capabilities of generative AI have dramatically expanded its possible use
cases in medicine. We provide a comprehensive overview of generative AI use cases for …

Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators

Z Liu, Q Tu, W Ye, Y **ao, Z Zhang, H Cui… - arxiv preprint arxiv …, 2025 - arxiv.org
Online medical consultation (OMC) restricts doctors to gathering patient information solely
through inquiries, making the already complex sequential decision-making process of …

Baichuan-M1: Pushing the Medical Capability of Large Language Models

B Wang, H Zhao, H Zhou, L Song, M Xu… - arxiv preprint arxiv …, 2025 - arxiv.org
The current generation of large language models (LLMs) is typically designed for broad,
general-purpose applications, while domain-specific LLMs, especially in vertical fields like …

Limitations of Large Language Models in Clinical Problem-Solving Arising from Inflexible Reasoning

J Kim, A Podlasek, K Shidara, F Liu, A Alaa… - arxiv preprint arxiv …, 2025 - arxiv.org
Large Language Models (LLMs) have attained human-level accuracy on medical question-
answer (QA) benchmarks. However, their limitations in navigating open-ended clinical …

Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models

KU Qasim, J Zhang, T Alsahfi, AUR Butt - arxiv preprint arxiv:2501.02026, 2025 - arxiv.org
Enhancing the reasoning capabilities of Large Language Models remains a critical
challenge in artificial intelligence. We introduce RDoLT, Recursive Decomposition of Logical …

Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models

S Sim, T Chen - arxiv preprint arxiv:2412.15748, 2024 - arxiv.org
Background: Despite the current ubiquity of Large Language Models (LLMs) across the
medical domain, there is a surprising lack of studies which address their reasoning …