- Academic Search

X Yin, B Gao, X Yu - Annual Reviews in Control, 2024 - Elsevier

In recent years, formal methods have been extensively used in the design of autonomous
systems. By employing mathematically rigorous techniques, formal methods can provide …

Zapisz Cytuj Cytowane przez 16 Powiązane artykuły Wszystkie wersje 5

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Trustworthy llms: a survey and guideline for evaluating large language models' alignment

Y Liu, Y Yao, JF Ton, X Zhang, R Guo, H Cheng… - arxiv preprint arxiv …, 2023 - arxiv.org

Ensuring alignment, which refers to making models behave in accordance with human
intentions [1, 2], has become a critical task before deploying large language models (LLMs) …

Zapisz Cytuj Cytowane przez 281 Powiązane artykuły Wszystkie wersje 3 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Hallucination detection in foundation models for decision-making: A flexible definition and review of the state of the art

N Chakraborty, M Ornik, K Driggs-Campbell - ACM Computing Surveys, 2025 - dl.acm.org

Autonomous systems are soon to be ubiquitous, spanning manufacturing, agriculture,
healthcare, entertainment, and other industries. Most of these systems are developed with …

Zapisz Cytuj Cytowane przez 7 Powiązane artykuły Wszystkie wersje 4

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Robots that ask for help: Uncertainty alignment for large language model planners

AZ Ren, A Dixit, A Bodrova, S Singh, S Tu… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models (LLMs) exhibit a wide range of promising capabilities--from step-by-
step planning to commonsense reasoning--that may provide utility for robots, but remain …

Zapisz Cytuj Cytowane przez 201 Powiązane artykuły Wszystkie wersje 7 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Benchmarking llms via uncertainty quantification

F Ye, M Yang, J Pang, L Wang, DF Wong… - arxiv preprint arxiv …, 2024 - arxiv.org

The proliferation of open-source Large Language Models (LLMs) from various institutions
has highlighted the urgent need for comprehensive evaluation methods. However, current …

Zapisz Cytuj Cytowane przez 46 Powiązane artykuły Wszystkie wersje 4 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] thelancet.com Full View

The diagnostic and triage accuracy of the GPT-3 artificial intelligence model: an observational study

DM Levine, R Tuwani, B Kompa, A Varma… - The Lancet Digital …, 2024 - thelancet.com

Background Artificial intelligence (AI) applications in health care have been effective in
many areas of medicine, but they are often trained for a single task using labelled data …

Zapisz Cytuj Cytowane przez 16 Powiązane artykuły Wszystkie wersje 5

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Conformal alignment: Knowing when to trust foundation models with guarantees

Y Gui, Y **, Z Ren - arxiv preprint arxiv:2405.10301, 2024 - arxiv.org

Before deploying outputs from foundation models in high-stakes tasks, it is imperative to
ensure that they align with human values. For instance, in radiology report generation …

Zapisz Cytuj Cytowane przez 15 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Api is enough: Conformal prediction for large language models without logit-access

J Su, J Luo, H Wang, L Cheng - arxiv preprint arxiv:2403.01216, 2024 - arxiv.org

This study aims to address the pervasive challenge of quantifying uncertainty in large
language models (LLMs) without logit-access. Conformal Prediction (CP), known for its …

Zapisz Cytuj Cytowane przez 14 Powiązane artykuły Wszystkie wersje 4 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Large language model validity via enhanced conformal prediction methods

JJ Cherian, I Gibbs, EJ Candès - arxiv preprint arxiv:2406.09714, 2024 - arxiv.org

We develop new conformal inference methods for obtaining validity guarantees on the
output of large language models (LLMs). Prior work in conformal language modeling …

Zapisz Cytuj Cytowane przez 14 Powiązane artykuły Wszystkie wersje 3 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

Conformal prediction for natural language processing: A survey

M Campos, A Farinhas, C Zerva… - Transactions of the …, 2024 - direct.mit.edu

The rapid proliferation of large language models and natural language processing (NLP)
applications creates a crucial need for uncertainty quantification to mitigate risks such as …

Zapisz Cytuj Cytowane przez 5 Powiązane artykuły Wszystkie wersje 4

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

Conformal prediction with large language models for multi-choice question answering

Formal synthesis of controllers for safety-critical autonomous systems: Developments and challenges

Trustworthy llms: a survey and guideline for evaluating large language models' alignment

Hallucination detection in foundation models for decision-making: A flexible definition and review of the state of the art

Robots that ask for help: Uncertainty alignment for large language model planners

Benchmarking llms via uncertainty quantification

The diagnostic and triage accuracy of the GPT-3 artificial intelligence model: an observational study

Conformal alignment: Knowing when to trust foundation models with guarantees

Api is enough: Conformal prediction for large language models without logit-access

Large language model validity via enhanced conformal prediction methods

Conformal prediction for natural language processing: A survey