A survey on large language model based autonomous agents

L Wang, C Ma, X Feng, Z Zhang, H Yang… - Frontiers of Computer …, 2024 - Springer
Autonomous agents have long been a research focus in academic and industry
communities. Previous research often focuses on training agents with limited knowledge …

Whose opinions do language models reflect?

S Santurkar, E Durmus, F Ladhak… - International …, 2023 - proceedings.mlr.press
Abstract Language models (LMs) are increasingly being used in open-ended contexts,
where the opinions they reflect in response to subjective queries can have a profound …

Artificial intelligence and illusions of understanding in scientific research

L Messeri, MJ Crockett - Nature, 2024 - nature.com
Scientists are enthusiastically imagining ways in which artificial intelligence (AI) tools might
improve research. Why are AI tools so attractive and what are the risks of implementing them …

Can AI language models replace human participants?

D Dillion, N Tandon, Y Gu, K Gray - Trends in Cognitive Sciences, 2023 - cell.com
Recent work suggests that language models such as GPT can make human-like judgments
across a number of domains. We explore whether and when language models might …

Using large language models in psychology

D Demszky, D Yang, DS Yeager, CJ Bryan… - Nature Reviews …, 2023 - nature.com
Large language models (LLMs), such as OpenAI's GPT-4, Google's Bard or Meta's LLaMa,
have created unprecedented opportunities for analysing and generating language data on a …

Large language models as simulated economic agents: What can we learn from homo silicus?

JJ Horton - 2023 - nber.org
Newly-developed large language models (LLM)—because of how they are trained and
designed—are implicit computational models of humans—a homo silicus. LLMs can be …

Using large language models to simulate multiple humans and replicate human subject studies

GV Aher, RI Arriaga, AT Kalai - International Conference on …, 2023 - proceedings.mlr.press
We introduce a new type of test, called a Turing Experiment (TE), for evaluating to what
extent a given language model, such as GPT models, can simulate different aspects of …