Formal synthesis of controllers for safety-critical autonomous systems: Developments and challenges

X Yin, B Gao, X Yu - Annual Reviews in Control, 2024 - Elsevier
In recent years, formal methods have been extensively used in the design of autonomous
systems. By employing mathematically rigorous techniques, formal methods can provide …

Trustworthy llms: a survey and guideline for evaluating large language models' alignment

Y Liu, Y Yao, JF Ton, X Zhang, R Guo, H Cheng… - arxiv preprint arxiv …, 2023 - arxiv.org
Ensuring alignment, which refers to making models behave in accordance with human
intentions [1, 2], has become a critical task before deploying large language models (LLMs) …

Hallucination detection in foundation models for decision-making: A flexible definition and review of the state of the art

N Chakraborty, M Ornik, K Driggs-Campbell - ACM Computing Surveys, 2025 - dl.acm.org
Autonomous systems are soon to be ubiquitous, spanning manufacturing, agriculture,
healthcare, entertainment, and other industries. Most of these systems are developed with …

Robots that ask for help: Uncertainty alignment for large language model planners

AZ Ren, A Dixit, A Bodrova, S Singh, S Tu… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) exhibit a wide range of promising capabilities--from step-by-
step planning to commonsense reasoning--that may provide utility for robots, but remain …

Benchmarking llms via uncertainty quantification

F Ye, M Yang, J Pang, L Wang, DF Wong… - arxiv preprint arxiv …, 2024 - arxiv.org
The proliferation of open-source Large Language Models (LLMs) from various institutions
has highlighted the urgent need for comprehensive evaluation methods. However, current …

The diagnostic and triage accuracy of the GPT-3 artificial intelligence model: an observational study

DM Levine, R Tuwani, B Kompa, A Varma… - The Lancet Digital …, 2024 - thelancet.com
Background Artificial intelligence (AI) applications in health care have been effective in
many areas of medicine, but they are often trained for a single task using labelled data …

Conformal alignment: Knowing when to trust foundation models with guarantees

Y Gui, Y **, Z Ren - arxiv preprint arxiv:2405.10301, 2024 - arxiv.org
Before deploying outputs from foundation models in high-stakes tasks, it is imperative to
ensure that they align with human values. For instance, in radiology report generation …

Api is enough: Conformal prediction for large language models without logit-access

J Su, J Luo, H Wang, L Cheng - arxiv preprint arxiv:2403.01216, 2024 - arxiv.org
This study aims to address the pervasive challenge of quantifying uncertainty in large
language models (LLMs) without logit-access. Conformal Prediction (CP), known for its …

Large language model validity via enhanced conformal prediction methods

JJ Cherian, I Gibbs, EJ Candès - arxiv preprint arxiv:2406.09714, 2024 - arxiv.org
We develop new conformal inference methods for obtaining validity guarantees on the
output of large language models (LLMs). Prior work in conformal language modeling …

Conformal prediction for natural language processing: A survey

M Campos, A Farinhas, C Zerva… - Transactions of the …, 2024 - direct.mit.edu
The rapid proliferation of large language models and natural language processing (NLP)
applications creates a crucial need for uncertainty quantification to mitigate risks such as …