Emergent tool use from multi-agent autocurricula

B Baker, I Kanitscheider, T Markov, Y Wu… - International …, 2019 - openreview.net
Through multi-agent competition, the simple objective of hide-and-seek, and standard
reinforcement learning algorithms at scale, we find that agents create a self-supervised …

Phyre: A new benchmark for physical reasoning

A Bakhtin, L van der Maaten… - Advances in …, 2019 - proceedings.neurips.cc
Understanding and reasoning about physics is an important ability of intelligent agents. We
develop the PHYRE benchmark for physical reasoning that contains a set of simple classical …

Making language models better tool learners with execution feedback

S Qiao, H Gui, C Lv, Q Jia, H Chen, N Zhang - arxiv preprint arxiv …, 2023 - arxiv.org
Tools serve as pivotal interfaces that enable humans to understand and reshape the
environment. With the advent of foundation models, AI systems can utilize tools to expand …

A bayesian-symbolic approach to reasoning and learning in intuitive physics

K Xu, A Srivastava, D Gutfreund… - Advances in neural …, 2021 - proceedings.neurips.cc
Humans can reason about intuitive physics in fully or partially observed environments even
after being exposed to a very limited set of observations. This sample-efficient intuitive …

Generalization to new actions in reinforcement learning

A Jain, A Szot, JJ Lim - arxiv preprint arxiv:2011.01928, 2020 - arxiv.org
A fundamental trait of intelligence is the ability to achieve goals in the face of novel
circumstances, such as making decisions from new action choices. However, standard …

Creative problem solving in artificially intelligent agents: A survey and framework

E Gizzi, L Nair, S Chernova, J Sinapov - Journal of Artificial Intelligence …, 2022 - jair.org
Abstract Creative Problem Solving (CPS) is a sub-area within Artificial Intelligence (AI) that
focuses on methods for solving off-nominal, or anomalous problems in autonomous …

Temporal and state abstractions for efficient learning, transfer, and composition in humans.

L **a, AGE Collins - Psychological review, 2021 - psycnet.apa.org
Humans use prior knowledge to efficiently solve novel tasks, but how they structure past
knowledge during learning to enable such fast generalization is not well understood. We …

Bongard-logo: A new benchmark for human-level concept learning and reasoning

W Nie, Z Yu, L Mao, AB Patel, Y Zhu… - Advances in Neural …, 2020 - proceedings.neurips.cc
Humans have an inherent ability to learn novel concepts from only a few samples and
generalize these concepts to different situations. Even though today's machine learning …

Forward prediction for physical reasoning

R Girdhar, L Gustafson, A Adcock… - arxiv preprint arxiv …, 2020 - arxiv.org
Physical reasoning requires forward prediction: the ability to forecast what will happen next
given some initial world state. We study the performance of state-of-the-art forward …

Learning to build physical structures better over time

W McCarthy, D Kirsh, J Fan - … of the Annual Meeting of the …, 2020 - escholarship.org
Our ability to plan and build a wide array of physicalstructures, from sand castles to
skyscrapers, is a definingfeature of modern human intelligence. What cognitive toolsenable …