Two-stage learning to defer with multiple experts

A Mao, C Mohri, M Mohri… - Advances in neural …, 2023 - proceedings.neurips.cc
We study a two-stage scenario for learning to defer with multiple experts, which is crucial in
practice for many applications. In this scenario, a predictor is derived in a first stage by …

Structured prediction with stronger consistency guarantees

A Mao, M Mohri, Y Zhong - Advances in Neural Information …, 2023 - proceedings.neurips.cc
We present an extensive study of surrogate losses for structured prediction supported by* $
H $-consistency bounds*. These are recently introduced guarantees that are more relevant …

Theoretically grounded loss functions and algorithms for score-based multi-class abstention

A Mao, M Mohri, Y Zhong - International Conference on …, 2024 - proceedings.mlr.press
Learning with abstention is a key scenario where the learner can abstain from making a
prediction at some cost. In this paper, we analyze the score-based formulation of learning …

-Consistency Bounds: Characterization and Extensions

A Mao, M Mohri, Y Zhong - Advances in Neural Information …, 2024 - proceedings.neurips.cc
A series of recent publications by Awasthi et al. have introduced the key notion of* $ H $-
consistency bounds* for surrogate loss functions. These are upper bounds on the zero-one …

Predictor-rejector multi-class abstention: Theoretical analysis and algorithms

A Mao, M Mohri, Y Zhong - International Conference on …, 2024 - proceedings.mlr.press
We study the key framework of learning with abstention in the multi-class classification
setting. In this setting, the learner can choose to abstain from making a prediction with some …

Learning to reject with a fixed predictor: Application to decontextualization

C Mohri, D Andor, E Choi, M Collins - arxiv preprint arxiv:2301.09044, 2023 - arxiv.org
We study the problem of classification with a reject option for a fixed predictor, applicable in
natural language processing. We introduce a new problem formulation for this scenario, and …

Learning to Defer to a Population: A Meta-Learning Approach

D Tailor, A Patra, R Verma… - International …, 2024 - proceedings.mlr.press
The learning to defer (L2D) framework allows autonomous systems to be safe and robust by
allocating difficult decisions to a human expert. All existing work on L2D assumes that each …

Multi-label learning with stronger consistency guarantees

A Mao, M Mohri, Y Zhong - arxiv preprint arxiv:2407.13746, 2024 - arxiv.org
We present a detailed study of surrogate losses and algorithms for multi-label learning,
supported by $ H $-consistency bounds. We first show that, for the simplest form of multi …

A universal growth rate for learning with smooth surrogate losses

A Mao, M Mohri, Y Zhong - arxiv preprint arxiv:2405.05968, 2024 - arxiv.org
This paper presents a comprehensive analysis of the growth rate of $ H $-consistency
bounds (and excess error bounds) for various surrogate losses used in classification. We …

Top- Classification and Cardinality-Aware Prediction

A Mao, M Mohri, Y Zhong - arxiv preprint arxiv:2403.19625, 2024 - arxiv.org
We present a detailed study of top-$ k $ classification, the task of predicting the $ k $ most
probable classes for an input, extending beyond single-class prediction. We demonstrate …