A survey of language model confidence estimation and calibration

J Geng, F Cai, Y Wang, H Koeppl, P Nakov… - ar** and ECV analysis from native and post-contrast cardiac T1 map** images using Bayesian vision …
TW Arega, S Bricq, F Legrand, A Jacquier… - Medical image …, 2023 - Elsevier
Deep learning-based methods for cardiac MR segmentation have achieved state-of-the-art
results. However, these methods can generate incorrect segmentation results which can …

Benchmarking uncertainty quantification methods for large language models with lm-polygraph

R Vashurin, E Fadeeva, A Vazhentsev… - arxiv preprint arxiv …, 2024 - arxiv.org
Uncertainty quantification (UQ) is a critical component of machine learning (ML)
applications. The rapid proliferation of large language models (LLMs) has stimulated …