Hallucination detection in foundation models for decision-making: A flexible definition and review of the state of the art
Autonomous systems are soon to be ubiquitous, spanning manufacturing, agriculture,
healthcare, entertainment, and other industries. Most of these systems are developed with …
healthcare, entertainment, and other industries. Most of these systems are developed with …
Confident adaptive language modeling
Recent advances in Transformer-based large language models (LLMs) have led to
significant performance improvements across many tasks. These gains come with a drastic …
significant performance improvements across many tasks. These gains come with a drastic …
A gentle introduction to conformal prediction and distribution-free uncertainty quantification
Black-box machine learning models are now routinely used in high-risk settings, like
medical diagnostics, which demand uncertainty quantification to avoid consequential model …
medical diagnostics, which demand uncertainty quantification to avoid consequential model …
Image-to-image regression with distribution-free uncertainty quantification and applications in imaging
Image-to-image regression is an important learning task, used frequently in biological
imaging. Current algorithms, however, do not generally offer statistical guarantees that …
imaging. Current algorithms, however, do not generally offer statistical guarantees that …
Conformal risk control
We extend conformal prediction to control the expected value of any monotone loss function.
The algorithm generalizes split conformal prediction together with its coverage guarantee …
The algorithm generalizes split conformal prediction together with its coverage guarantee …
Improved online conformal prediction via strongly adaptive online learning
We study the problem of uncertainty quantification via prediction sets, in an online setting
where the data distribution may vary arbitrarily over time. Recent work develops online …
where the data distribution may vary arbitrarily over time. Recent work develops online …
Conformal language modeling
We propose a novel approach to conformal prediction for generative language models
(LMs). Standard conformal prediction produces prediction sets--in place of single predictions …
(LMs). Standard conformal prediction produces prediction sets--in place of single predictions …
Conformal prediction: A gentle introduction
Black-box machine learning models are now routinely used in high-risk settings, like
medical diagnostics, which demand uncertainty quantification to avoid consequential model …
medical diagnostics, which demand uncertainty quantification to avoid consequential model …
Reliable visual question answering: Abstain rather than answer incorrectly
Abstract Machine learning has advanced dramatically, narrowing the accuracy gap to
humans in multimodal tasks like visual question answering (VQA). However, while humans …
humans in multimodal tasks like visual question answering (VQA). However, while humans …
Robust calibration with multi-domain temperature scaling
Uncertainty quantification is essential for the reliable deployment of machine learning
models to high-stakes application domains. Uncertainty quantification is all the more …
models to high-stakes application domains. Uncertainty quantification is all the more …