Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

Dissociating language and thought in large language models

K Mahowald, AA Ivanova, IA Blank, N Kanwisher… - Trends in cognitive …, 2024 - cell.com
Large language models (LLMs) have come closest among all models to date to mastering
human language, yet opinions about their linguistic and cognitive capabilities remain split …

Evaluation and mitigation of the limitations of large language models in clinical decision-making

P Hager, F Jungmann, R Holland, K Bhagat… - Nature medicine, 2024 - nature.com
Clinical decision-making is one of the most impactful parts of a physician's responsibilities
and stands to benefit greatly from artificial intelligence solutions and large language models …

Large language models as commonsense knowledge for large-scale task planning

Z Zhao, WS Lee, D Hsu - Advances in Neural Information …, 2023 - proceedings.neurips.cc
Large-scale task planning is a major challenge. Recent work exploits large language
models (LLMs) directly as a policy and shows surprisingly interesting results. This paper …

Position: LLMs can't plan, but can help planning in LLM-modulo frameworks

S Kambhampati, K Valmeekam, L Guan… - … on Machine Learning, 2024 - openreview.net
We argue that auto-regressive LLMs cannot, by themselves, do planning or self-verification
(which is after all a form of reasoning), and shed some light on the reasons for …

Evaluating correctness and faithfulness of instruction-following models for question answering

V Adlakha, P BehnamGhader, XH Lu… - Transactions of the …, 2024 - direct.mit.edu
Instruction-following models are attractive alternatives to fine-tuned approaches for question
answering (QA). By simply prepending relevant documents and an instruction to their input …

Generative AI for economic research: Use cases and implications for economists

A Korinek - Journal of Economic Literature, 2023 - aeaweb.org
Generative artificial intelligence (AI) has the potential to revolutionize research. I analyze
how large language models (LLMs) such as ChatGPT can assist economists by describing …

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arxiv preprint arxiv …, 2024 - arxiv.org
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Embers of autoregression: Understanding large language models through the problem they are trained to solve

RT McCoy, S Yao, D Friedman, M Hardy… - arxiv preprint arxiv …, 2023 - arxiv.org
The widespread adoption of large language models (LLMs) makes it important to recognize
their strengths and limitations. We argue that in order to develop a holistic understanding of …

What algorithms can transformers learn? a study in length generalization

H Zhou, A Bradley, E Littwin, N Razin, O Saremi… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models exhibit surprising emergent generalization properties, yet also
struggle on many simple reasoning tasks such as arithmetic and parity. This raises the …