The rise and potential of large language model based agents: A survey

Z **, W Chen, X Guo, W He, Y Ding, B Hong… - Science China …, 2025 - Springer
For a long time, researchers have sought artificial intelligence (AI) that matches or exceeds
human intelligence. AI agents, which are artificial entities capable of sensing the …

Dissociating language and thought in large language models

K Mahowald, AA Ivanova, IA Blank, N Kanwisher… - Trends in Cognitive …, 2024 - cell.com
Large language models (LLMs) have come closest among all models to date to mastering
human language, yet opinions about their linguistic and cognitive capabilities remain split …

Reasoning with language model prompting: A survey

S Qiao, Y Ou, N Zhang, X Chen, Y Yao, S Deng… - arxiv preprint arxiv …, 2022 - arxiv.org
Reasoning, as an essential ability for complex problem-solving, can provide back-end
support for various real-world applications, such as medical diagnosis, negotiation, etc. This …

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arxiv preprint arxiv …, 2024 - arxiv.org
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Exploring collaboration mechanisms for llm agents: A social psychology view

J Zhang, X Xu, N Zhang, R Liu, B Hooi… - arxiv preprint arxiv …, 2023 - arxiv.org
As Natural Language Processing (NLP) systems are increasingly employed in intricate
social environments, a pressing query emerges: Can these NLP systems mirror human …

Exploring large language models for communication games: An empirical study on werewolf

Y Xu, S Wang, P Li, F Luo, X Wang, W Liu… - arxiv preprint arxiv …, 2023 - arxiv.org
Communication games, which we refer to as incomplete information games that heavily
depend on natural language communication, hold significant research value in fields such …

Understanding social reasoning in language models with language models

K Gandhi, JP Fränken… - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract As Large Language Models (LLMs) become increasingly integrated into our
everyday lives, understanding their ability to comprehend human mental states becomes …

Evaluating large language models in theory of mind tasks

M Kosinski - Proceedings of the National Academy of Sciences, 2024 - pnas.org
Eleven large language models (LLMs) were assessed using 40 bespoke false-belief tasks,
considered a gold standard in testing theory of mind (ToM) in humans. Each task included a …

Arb: Advanced reasoning benchmark for large language models

T Sawada, D Paleka, A Havrilla, P Tadepalli… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have demonstrated remarkable performance on various
quantitative reasoning and knowledge benchmarks. However, many of these benchmarks …

Minding language models'(lack of) theory of mind: A plug-and-play multi-character belief tracker

M Sclar, S Kumar, P West, A Suhr, Y Choi… - arxiv preprint arxiv …, 2023 - arxiv.org
Theory of Mind (ToM) $\unicode {x2014} $ the ability to reason about the mental states of
other people $\unicode {x2014} $ is a key element of our social intelligence. Yet, despite …