Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

Language model behavior: A comprehensive survey

TA Chang, BK Bergen - Computational Linguistics, 2024 - direct.mit.edu
Transformer language models have received widespread public attention, yet their
generated text is often surprising even to NLP researchers. In this survey, we discuss over …

Distilling reasoning capabilities into smaller language models

K Shridhar, A Stolfo, M Sachan - arxiv preprint arxiv:2212.00193, 2022 - arxiv.org
Step-by-step reasoning approaches like chain of thought (CoT) have proved to be very
effective in inducing reasoning capabilities in large language models. However, the success …

Evaluating large language models: A comprehensive survey

Z Guo, R **, C Liu, Y Huang, D Shi, L Yu, Y Liu… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have demonstrated remarkable capabilities across a broad
spectrum of tasks. They have attracted significant attention and been deployed in numerous …

Large language models for mathematical reasoning: Progresses and challenges

J Ahn, R Verma, R Lou, D Liu, R Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive
capabilities of human intelligence. In recent times, there has been a notable surge in the …

A mechanistic interpretation of arithmetic reasoning in language models using causal mediation analysis

A Stolfo, Y Belinkov, M Sachan - arxiv preprint arxiv:2305.15054, 2023 - arxiv.org
Mathematical reasoning in large language models (LMs) has garnered significant attention
in recent work, but there is a limited understanding of how these models process and store …

SemEval-2024 task 2: Safe biomedical natural language inference for clinical trials

M Jullien, M Valentino, A Freitas - arxiv preprint arxiv:2404.04963, 2024 - arxiv.org
Large Language Models (LLMs) are at the forefront of NLP achievements but fall short in
dealing with shortcut learning, factual inconsistency, and vulnerability to adversarial inputs …

Autonomous GIS: the next-generation AI-powered GIS

Z Li, H Ning - International Journal of Digital Earth, 2023 - Taylor & Francis
ABSTRACT Large Language Models (LLMs), such as ChatGPT, demonstrate a strong
understanding of human natural language and have been explored and applied in various …

Cladder: Assessing causal reasoning in language models

Z **, Y Chen, F Leeb, L Gresele, O Kamal… - … conference on neural …, 2023 - openreview.net
The ability to perform causal reasoning is widely considered a core feature of intelligence. In
this work, we investigate whether large language models (LLMs) can coherently reason …

Causal prompting: Debiasing large language model prompting based on front-door adjustment

C Zhang, L Zhang, J Wu, Y He, D Zhou - arxiv preprint arxiv:2403.02738, 2024 - arxiv.org
Despite the notable advancements of existing prompting methods, such as In-Context
Learning and Chain-of-Thought for Large Language Models (LLMs), they still face …