The rise and potential of large language model based agents: A survey
For a long time, researchers have sought artificial intelligence (AI) that matches or exceeds
human intelligence. AI agents, which are artificial entities capable of sensing the …
human intelligence. AI agents, which are artificial entities capable of sensing the …
Dissociating language and thought in large language models
Large language models (LLMs) have come closest among all models to date to mastering
human language, yet opinions about their linguistic and cognitive capabilities remain split …
human language, yet opinions about their linguistic and cognitive capabilities remain split …
Reasoning with language model prompting: A survey
Reasoning, as an essential ability for complex problem-solving, can provide back-end
support for various real-world applications, such as medical diagnosis, negotiation, etc. This …
support for various real-world applications, such as medical diagnosis, negotiation, etc. This …
Foundational challenges in assuring alignment and safety of large language models
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …
language models (LLMs). These challenges are organized into three different categories …
Exploring collaboration mechanisms for llm agents: A social psychology view
As Natural Language Processing (NLP) systems are increasingly employed in intricate
social environments, a pressing query emerges: Can these NLP systems mirror human …
social environments, a pressing query emerges: Can these NLP systems mirror human …
Exploring large language models for communication games: An empirical study on werewolf
Communication games, which we refer to as incomplete information games that heavily
depend on natural language communication, hold significant research value in fields such …
depend on natural language communication, hold significant research value in fields such …
Understanding social reasoning in language models with language models
Abstract As Large Language Models (LLMs) become increasingly integrated into our
everyday lives, understanding their ability to comprehend human mental states becomes …
everyday lives, understanding their ability to comprehend human mental states becomes …
Evaluating large language models in theory of mind tasks
M Kosinski - Proceedings of the National Academy of Sciences, 2024 - pnas.org
Eleven large language models (LLMs) were assessed using 40 bespoke false-belief tasks,
considered a gold standard in testing theory of mind (ToM) in humans. Each task included a …
considered a gold standard in testing theory of mind (ToM) in humans. Each task included a …
Arb: Advanced reasoning benchmark for large language models
Large Language Models (LLMs) have demonstrated remarkable performance on various
quantitative reasoning and knowledge benchmarks. However, many of these benchmarks …
quantitative reasoning and knowledge benchmarks. However, many of these benchmarks …
Minding language models'(lack of) theory of mind: A plug-and-play multi-character belief tracker
Theory of Mind (ToM) $\unicode {x2014} $ the ability to reason about the mental states of
other people $\unicode {x2014} $ is a key element of our social intelligence. Yet, despite …
other people $\unicode {x2014} $ is a key element of our social intelligence. Yet, despite …