Challenges and applications of large language models
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …
A survey of large language models
Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …
Toolqa: A dataset for llm question answering with external tools
Abstract Large Language Models (LLMs) have demonstrated impressive performance in
various NLP tasks, but they still suffer from challenges such as hallucination and weak …
various NLP tasks, but they still suffer from challenges such as hallucination and weak …
Pal: Program-aided language models
Large language models (LLMs) have demonstrated an impressive ability to perform
arithmetic and symbolic reasoning tasks, when provided with a few examples at test time (" …
arithmetic and symbolic reasoning tasks, when provided with a few examples at test time (" …
Evoprompting: Language models for code-level neural architecture search
Given the recent impressive accomplishments of language models (LMs) for code
generation, we explore the use of LMs as general adaptive mutation and crossover …
generation, we explore the use of LMs as general adaptive mutation and crossover …
Faithful chain-of-thought reasoning
While Chain-of-Thought (CoT) prompting boosts Language Models'(LM) performance on a
gamut of complex reasoning tasks, the generated reasoning chain does not necessarily …
gamut of complex reasoning tasks, the generated reasoning chain does not necessarily …
Teaching arithmetic to small transformers
Large language models like GPT-4 exhibit emergent capabilities across general-purpose
tasks, such as basic arithmetic, when trained on extensive text data, even though these tasks …
tasks, such as basic arithmetic, when trained on extensive text data, even though these tasks …
Large language models are few-shot health learners
Large language models (LLMs) can capture rich representations of concepts that are useful
for real-world tasks. However, language alone is limited. While existing LLMs excel at text …
for real-world tasks. However, language alone is limited. While existing LLMs excel at text …
Evaluating and improving tool-augmented computation-intensive math reasoning
Chain-of-thought prompting (CoT) and tool augmentation have been validated in recent
work as effective practices for improving large language models (LLMs) to perform step-by …
work as effective practices for improving large language models (LLMs) to perform step-by …
Towards faithful model explanation in nlp: A survey
End-to-end neural Natural Language Processing (NLP) models are notoriously difficult to
understand. This has given rise to numerous efforts towards model explainability in recent …
understand. This has given rise to numerous efforts towards model explainability in recent …