Enhancing the reliability and accuracy of AI-enabled diagnosis via complementarity-driven deferral to clinicians

K Dvijotham, J Winkens, M Barsbey, S Ghaisas… - Nature Medicine, 2023 - nature.com
Predictive artificial intelligence (AI) systems based on deep learning have been shown to
achieve expert-level identification of diseases in multiple medical imaging settings, but can …

Two-stage learning to defer with multiple experts

A Mao, C Mohri, M Mohri… - Advances in neural …, 2024 - proceedings.neurips.cc
We study a two-stage scenario for learning to defer with multiple experts, which is crucial in
practice for many applications. In this scenario, a predictor is derived in a first stage by …

Theoretically grounded loss functions and algorithms for score-based multi-class abstention

A Mao, M Mohri, Y Zhong - International Conference on …, 2024 - proceedings.mlr.press
Learning with abstention is a key scenario where the learner can abstain from making a
prediction at some cost. In this paper, we analyze the score-based formulation of learning …

Predictor-rejector multi-class abstention: Theoretical analysis and algorithms

A Mao, M Mohri, Y Zhong - International Conference on …, 2024 - proceedings.mlr.press
We study the key framework of learning with abstention in the multi-class classification
setting. In this setting, the learner can choose to abstain from making a prediction with some …

In defense of softmax parametrization for calibrated and consistent learning to defer

Y Cao, H Mozannar, L Feng… - Advances in Neural …, 2024 - proceedings.neurips.cc
Enabling machine learning classifiers to defer their decision to a downstream expert when
the expert is more accurate will ensure improved safety and performance. This objective can …

Principled approaches for learning to defer with multiple experts

A Mao, M Mohri, Y Zhong - International Workshop on Combinatorial …, 2024 - Springer
We present a study of surrogate losses and algorithms for the general problem of learning to
defer with multiple experts. We first introduce a new family of surrogate losses specifically …

When does confidence-based cascade deferral suffice?

W Jitkrittum, N Gupta, AK Menon… - Advances in …, 2024 - proceedings.neurips.cc
Cascades are a classical strategy to enable inference cost to vary adaptively across
samples, wherein a sequence of classifiers are invoked in turn. A deferral rule determines …

Learning to defer to multiple experts: Consistent surrogate losses, confidence calibration, and conformal ensembles

R Verma, D Barrejón… - … Conference on Artificial …, 2023 - proceedings.mlr.press
We study the statistical properties of learning to defer (L2D) to multiple experts. In particular,
we address the open problems of deriving a consistent surrogate loss, confidence …

Plan-Then-Execute: An Empirical Study of User Trust and Team Performance When Using LLM Agents As A Daily Assistant

G He, G Demartini, U Gadiraju - arxiv preprint arxiv:2502.01390, 2025 - arxiv.org
Since the explosion in popularity of ChatGPT, large language models (LLMs) have
continued to impact our everyday lives. Equipped with external tools that are designed for a …

Learning to Defer to a Population: A Meta-Learning Approach

D Tailor, A Patra, R Verma… - International …, 2024 - proceedings.mlr.press
The learning to defer (L2D) framework allows autonomous systems to be safe and robust by
allocating difficult decisions to a human expert. All existing work on L2D assumes that each …