Reliable, adaptable, and attributable language models with retrieval

A Asai, Z Zhong, D Chen, PW Koh… - ar** powerful general-purpose agents,
wherein Foundation Models are used as modules within agentic systems (eg Chain-of …

Internal consistency and self-feedback in large language models: A survey

X Liang, S Song, Z Zheng, H Wang, Q Yu, X Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) often exhibit deficient reasoning or generate hallucinations.
To address these, studies prefixed with" Self-" such as Self-Consistency, Self-Improve, and …

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

S Shankar, T Chambers, T Shah… - arxiv preprint arxiv …, 2024 - arxiv.org
Analyzing unstructured data has been a persistent challenge in data processing. Large
Language Models (LLMs) have shown promise in this regard, leading to recent proposals …

How to correctly do semantic backpropagation on language-based agentic systems

W Wang, HA Alyahya, DR Ashley, O Serikov… - arxiv preprint arxiv …, 2024 - arxiv.org
Language-based agentic systems have shown great promise in recent years, transitioning
from solving small-scale research problems to being deployed in challenging real-world …

Aviary: training language agents on challenging scientific tasks

S Narayanan, JD Braza, RR Griffiths… - arxiv preprint arxiv …, 2024 - arxiv.org
Solving complex real-world tasks requires cycles of actions and observations. This is
particularly true in science, where tasks require many cycles of analysis, tool use, and …

Fast inference for augmented large language models

R Shahout, C Liang, S **n, Q Lao, Y Cui, M Yu… - arxiv preprint arxiv …, 2024 - arxiv.org
Augmented Large Language Models (LLMs) enhance the capabilities of standalone LLMs
by integrating external data sources through API calls. In interactive LLM applications …

Beyond the Comfort Zone: Emerging Solutions to Overcome Challenges in Integrating LLMs into Software Products

N Nahar, C Kästner, J Butler, C Parnin… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) are increasingly embedded into software products across
diverse industries, enhancing user experiences, but at the same time introducing numerous …

AIME: AI System Optimization via Multiple LLM Evaluators

B Patel, S Chakraborty, WA Suttle, M Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Text-based AI system optimization typically involves a feedback loop scheme where a single
LLM generates an evaluation in natural language of the current output to improve the next …

Task Facet Learning: A Structured Approach to Prompt Optimization

G Juneja, N Natarajan, H Li, J Jiao… - arxiv preprint arxiv …, 2024 - arxiv.org
Given a task in the form of a basic description and its training examples, prompt optimization
is the problem of synthesizing the given information into a text prompt for a large language …