Collaborating with humans without human data
Collaborating with humans requires rapidly adapting to their individual strengths,
weaknesses, and preferences. Unfortunately, most standard multi-agent reinforcement …
weaknesses, and preferences. Unfortunately, most standard multi-agent reinforcement …
Maximum entropy population-based training for zero-shot human-ai coordination
We study the problem of training a Reinforcement Learning (RL) agent that is collaborative
with humans without using human data. Although such agents can be obtained through self …
with humans without using human data. Although such agents can be obtained through self …
Learning zero-shot cooperation with humans, assuming humans are biased
There is a recent trend of applying multi-agent reinforcement learning (MARL) to train an
agent that can cooperate with humans in a zero-shot fashion without using any human data …
agent that can cooperate with humans in a zero-shot fashion without using any human data …
Cooperative open-ended learning framework for zero-shot coordination
Zero-shot coordination in cooperative artificial intelligence (AI) remains a significant
challenge, which means effectively coordinating with a wide range of unseen partners …
challenge, which means effectively coordinating with a wide range of unseen partners …
Semantically aligned task decomposition in multi-agent reinforcement learning
The difficulty of appropriately assigning credit is particularly heightened in cooperative
MARL with sparse reward, due to the concurrent time and structural scales involved …
MARL with sparse reward, due to the concurrent time and structural scales involved …
Quantifying the effects of environment and population diversity in multi-agent reinforcement learning
Generalization is a major challenge for multi-agent reinforcement learning. How well does
an agent perform when placed in novel environments and in interactions with new co …
an agent perform when placed in novel environments and in interactions with new co …
Tackling cooperative incompatibility for zero-shot human-ai coordination
Securing coordination between AI agent and teammates (human players or AI agents) in
contexts involving unfamiliar humans continues to pose a significant challenge in Zero-Shot …
contexts involving unfamiliar humans continues to pose a significant challenge in Zero-Shot …
The boltzmann policy distribution: Accounting for systematic suboptimality in human models
Models of human behavior for prediction and collaboration tend to fall into two categories:
ones that learn from large amounts of data via imitation learning, and ones that assume …
ones that learn from large amounts of data via imitation learning, and ones that assume …
Coach: Cooperative robot teaching
Abstract Knowledge and skills can transfer from human teachers to human students.
However, such direct transfer is often not scalable for physical tasks, as they require one-to …
However, such direct transfer is often not scalable for physical tasks, as they require one-to …
Human-ai shared control via policy dissection
Human-AI shared control allows human to interact and collaborate with autonomous agents
to accomplish control tasks in complex environments. Previous Reinforcement Learning …
to accomplish control tasks in complex environments. Previous Reinforcement Learning …