- Academic Search

Z **, W Chen, X Guo, W He, Y Ding, B Hong… - Science China …, 2025 - Springer

For a long time, researchers have sought artificial intelligence (AI) that matches or exceeds
human intelligence. AI agents, which are artificial entities capable of sensing the …

Save Cite Cited by 735 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Dissociating language and thought in large language models

K Mahowald, AA Ivanova, IA Blank, N Kanwisher… - Trends in Cognitive …, 2024 - cell.com

Large language models (LLMs) have come closest among all models to date to mastering
human language, yet opinions about their linguistic and cognitive capabilities remain split …

Save Cite Cited by 434 Related articles All 10 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Reasoning with language model prompting: A survey

S Qiao, Y Ou, N Zhang, X Chen, Y Yao, S Deng… - arxiv preprint arxiv …, 2022 - arxiv.org

Reasoning, as an essential ability for complex problem-solving, can provide back-end
support for various real-world applications, such as medical diagnosis, negotiation, etc. This …

Save Cite Cited by 257 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arxiv preprint arxiv …, 2024 - arxiv.org

This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

Save Cite Cited by 120 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Exploring collaboration mechanisms for llm agents: A social psychology view

J Zhang, X Xu, N Zhang, R Liu, B Hooi… - arxiv preprint arxiv …, 2023 - arxiv.org

As Natural Language Processing (NLP) systems are increasingly employed in intricate
social environments, a pressing query emerges: Can these NLP systems mirror human …

Save Cite Cited by 95 Related articles All 4 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Exploring large language models for communication games: An empirical study on werewolf

Y Xu, S Wang, P Li, F Luo, X Wang, W Liu… - arxiv preprint arxiv …, 2023 - arxiv.org

Communication games, which we refer to as incomplete information games that heavily
depend on natural language communication, hold significant research value in fields such …

Save Cite Cited by 147 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Understanding social reasoning in language models with language models

K Gandhi, JP Fränken… - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract As Large Language Models (LLMs) become increasingly integrated into our
everyday lives, understanding their ability to comprehend human mental states becomes …

Save Cite Cited by 89 Related articles All 10 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] pnas.org

Evaluating large language models in theory of mind tasks

M Kosinski - Proceedings of the National Academy of Sciences, 2024 - pnas.org

Eleven large language models (LLMs) were assessed using 40 bespoke false-belief tasks,
considered a gold standard in testing theory of mind (ToM) in humans. Each task included a …

Save Cite Cited by 43 Related articles

[Free GPT-4]

[PDF] arxiv.org

Arb: Advanced reasoning benchmark for large language models

T Sawada, D Paleka, A Havrilla, P Tadepalli… - arxiv preprint arxiv …, 2023 - arxiv.org

Large Language Models (LLMs) have demonstrated remarkable performance on various
quantitative reasoning and knowledge benchmarks. However, many of these benchmarks …

Save Cite Cited by 58 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Minding language models'(lack of) theory of mind: A plug-and-play multi-character belief tracker

M Sclar, S Kumar, P West, A Suhr, Y Choi… - arxiv preprint arxiv …, 2023 - arxiv.org

Theory of Mind (ToM) $\unicode {x2014} $ the ability to reason about the mental states of
other people $\unicode {x2014} $ is a key element of our social intelligence. Yet, despite …

Save Cite Cited by 59 Related articles All 6 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Clever hans or neural theory of mind? stress testing social reasoning in large language models

The rise and potential of large language model based agents: A survey

Dissociating language and thought in large language models

Reasoning with language model prompting: A survey

Foundational challenges in assuring alignment and safety of large language models

Exploring collaboration mechanisms for llm agents: A social psychology view

Exploring large language models for communication games: An empirical study on werewolf

Understanding social reasoning in language models with language models

Evaluating large language models in theory of mind tasks

Arb: Advanced reasoning benchmark for large language models

Minding language models'(lack of) theory of mind: A plug-and-play multi-character belief tracker