Maven: Multi-agent variational exploration
Centralised training with decentralised execution is an important setting for cooperative
deep multi-agent reinforcement learning due to communication constraints during execution …
deep multi-agent reinforcement learning due to communication constraints during execution …
Rode: Learning roles to decompose multi-agent tasks
Role-based learning holds the promise of achieving scalable multi-agent learning by
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …
decomposing complex tasks using roles. However, it is largely unclear how to efficiently …
Uneven: Universal value exploration for multi-agent reinforcement learning
VDN and QMIX are two popular value-based algorithms for cooperative MARL that learn a
centralized action value function as a monotonic mixing of per-agent utilities. While this …
centralized action value function as a monotonic mixing of per-agent utilities. While this …
Tesseract: Tensorised actors for multi-agent reinforcement learning
Reinforcement Learning in large action spaces is a challenging problem. This is especially
true for cooperative multi-agent reinforcement learning (MARL), which often requires …
true for cooperative multi-agent reinforcement learning (MARL), which often requires …
Virel: A variational inference framework for reinforcement learning
Applying probabilistic models to reinforcement learning (RL) enables the uses of powerful
optimisation tools such as variational inference in RL. However, existing inference …
optimisation tools such as variational inference in RL. However, existing inference …
Invariant transform experience replay: Data augmentation for deep reinforcement learning
Deep Reinforcement Learning (RL) is a promising approach for adaptive robot control, but
its current application to robotics is currently hindered by high sample requirements. To …
its current application to robotics is currently hindered by high sample requirements. To …
Addressing imperfect symmetry: a novel symmetry-learning actor-critic extension
Symmetry, a fundamental concept to understand our environment, often oversimplifies
reality from a mathematical perspective. Humans are a prime example, deviating from …
reality from a mathematical perspective. Humans are a prime example, deviating from …
Knowledge-guided exploration in deep reinforcement learning
This paper proposes a new method to drastically speed up deep reinforcement learning
(deep RL) training for problems that have the property of state-action permissibility (SAP) …
(deep RL) training for problems that have the property of state-action permissibility (SAP) …
[PDF][PDF] Action permissibility in deep reinforcement learning and application to autonomous driving
This paper is concerned with deep reinforcement learning (deep RL) in continuous state and
action space. It proposes a new method that can drastically speed up RL training for …
action space. It proposes a new method that can drastically speed up RL training for …
[HTML][HTML] Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension
Symmetry, a fundamental concept to understand our environment, often oversimplifies
reality from a mathematical perspective. Humans are a prime example, deviating from …
reality from a mathematical perspective. Humans are a prime example, deviating from …